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Editor's Preface 


Measurement systems have been in use since the times of earliest man. However, 
only in recent decades has it become clear to many that there exists an under- 
lying collection of fundamentals that is applicable, in part or whole, to all 
measurement situations regardless of how diverse the applications appear to be. 

The 1960’s saw the start of many discussions, at conferences and the like, on 
what constitutes this set of knowledge. By the 1970’s the situation was such that 
there was debate only on the finer points of what to include, not the core 
material. 

Around 1976 1 felt the need to resolve the issue, in the spirit of measurement, 
by producing a printed statement that would act as a standard for future use 
and comparison. Subsequently I put a proposal to my colleagues on the 
Higher Education Committee (TC-I) of the International Measurement 
Confederation IMEKO that the Committee should produce such a statement 
of the fundamentals involved. It was decided that I should personally produce 
such a Handbook using committee-men and other persons to help me design 
the structure and assist with contributions. 

In 1977 I circularized a possible structure of some 36 chapters, along with 
brief abstracts, for comment. (This was realized by sorting my personal collec- 
tion of texts, reprints, lecture notes, and experience after removal of works 
concerned with application only.) The fifteen experts who studied the proposal 
agreed, with a few minor changes, that the format represented the fundamental 
material of measurement systems. In this way it was resolved that measurement 
science had reached the degree of stability of content needed for a topic 
to become teachable and usable in a systematic rigorous manner. A simulta- 
neous endeavour worthy of mention here as it supports the sentiment of this 
Handbook is the liistrurnem Science series of solicited review articles run 
m the Journal of Physics E: Scientific Instruments. (Now published (1982) as 
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‘Instrument Science and Technology Volume 1’ B E Jones (ed ) Adam Hilger, 
Bristol) After incorporation of the suggestions received a general outline docu- 
ment was prepared and sent to invited contributors 

This Handbook provides what the majority of the many hundreds of books 
on measurement do not— the basic endunng fundamentals of design It does 
not seek to provide as its mam theme, a summary or a catalogue, of instrument 
application D M Considme, in the preface for his Process Instruments and 
Control Handbook stated ‘the work was a step forward (in 1957) in the matur- 
ation of a mature science’ That publication related very much to the practice 
of instrumentation It is my belief that this book will be another key document 
in the maturity of measurement systems design, one laying down fundamental 
design building principles 

The contents are structured according to the philosophical sequence in 
which a measurement system is first conceived, then designed, made, installed, 
and maintained The material divides conveniently into subjects concerned 
principally with theoretical principles needed to design measurement systems— 
this constitutes Volume 1— and fundamental material concerned with more 
specific design, application, and maintenance of measurement systems— 
Volume 2 

Volume 1 begins with the theoretical basics of measurement, the basic 
interface existing between the problem and the designer Conceptual under- 
standing IS then continued in Chapter 2 followed, in Chapter 3, by an introduc- 
tion to the terminology used 

Theory of signals is given m Chapter4 providing a point of reference to much 
of the signal theory involved in realizing measurement system stages The 
increasing importance of digital signals is recognized by the inclusion of 
Chapter 5 Error in measurement is covered in Chapter 6 

In a measurement situation the fundamental need is to be able to map one 
parameter of the system of interest into a mathematical equivalent In pattern 
recognition (Chapter 7) the need is to establish a many-to-one mapping and as 
such this subject is most important in measurement systems design Chapter 8, 
on parameter estimation, is concerned with realization of measurement values 
m situations where randomness is prevalent Signals of the analog or digital 
form invariably need some form of selective spectral processing as they are 
conveyed through the measurement system chain and Chapters 9 and 10 deal 
with this 

Despite the existence of many texts on instrumentation almost none of them 
address the problem of extracting signals m the presence of noise sources at 
any useful depth Chapter 1 1 is, in this way a unique contribution Chapter 12 
introduces the Handbook user to the methodology of converting signals 
between the analog and digital signal format domains 

A most valuable feature of electronically based measurement systems, 
compared with the alternatives, is that the sensors can be virtually any distance 
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Converting a sheaf of original lypescnpt and diagrams into a consistent well 
produced Handbook is a task equally as great as writing it My appreciation is 
extended to the production team of John Wiley and Sons at their Chichester, 
UK, location 


P H Sydenham 

Adelaide 
October, 1981 
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Chapter 

1 L. FINKELSTEIN 


Theory and Philosophy 
of Measurement 


Editorial introduction 

Measurement is a fundamental procedure in obtaining knowledge and in controlling 
systems. Numerous texts and published papers are devoted to its practice, a pursuit that 
dates since earliest man and is followed in all manner of human enterprise. Yet surprisingly 
we are only just beginning to try to understand the philosophical processes occurring at 
the measurement interface expressing these in terms of a formalized mathematical model. 

Formalization with mathematics enables a topic to be mechanized with man-made 
machines— such as 19th century Boolean algebra that eventually led to today’s digital 
computer systems. It also enables refinement of procedures. When we have developed a 
satisfactory formal approach to all measurement, and when that is widely understood, we 
should be able to be more efficient in designing and using measurement systems and in 
reworking data collected in the past. 

This chapter reviews the advances made in measurement theory that can be readily 
understood and applied by the practitioner. It also analyzes why measurement is a funda- 
mental method of science and presents a critical review of what constitutes a measurement. 
The gap between the abstract philosopher’s approach and that of the pragmatic instrument 
designer is gradually being bridged. This aspect of measurement systems is probably the 
least taught, least understood, and least heeded by physical hardware designers. It is the 
key to real success if advances in application (especially in hardware capability) are to be 
given freedom to be used. Philosophical design strategies are needed urgently, strategies 
that can be applied by non-specializing engineers and scientists. 

Of the several possible ways in which a measurement interface can be modelled, the 
mathematical one appears to be the most basic and useful in the long term. In this case, 
modelling is done in terms of set theory. To assist, an appendix is presented in which the 
relevant theory is outlined; Lin and Lin (1974) is also useful. 

As will be seen in this chapter just what constitutes a measurement and what kinds of 
events can be measured are still very much matters for debate. This reflects the embryonic 
nature of the philosophy of measurement and its, as yet, paucity of exposure in areas of 
measurement practice. Other subsequent chapters also depict other descriptions of the 
measurement process. Each has something to contribute to assist design or application of 
measurement systems. 



2 


HANDBOOK OF MEASUREMENT SCIENCE 


1.1 INTRODUCTION 

Measurement is the process of empincal, objective assignment of numbers to 
the properties of objects and events of the real world m such a way as to describe 
them 

Measurement is the most fundamental method of science Firstly, science 
aims at an objective empirical description of the universe and thus measurement 
of what IS observed is the goal towards which scientific investigation is directed 
Galileo Galiliei expressed this when he made his programmatic statement 
Count what is countable, measure what is measurable and what is not measur- 
able, make measurable ’ Measurement enables the laws and theories of science 
to be expressed in the precise and concise language of mathematics Science aims 
at describing complete domains of knowledge using measured data expressed in 
mathematical formalism The mathematical description of knowledge is said to 
be the hallmark of true science This view is usually expressed in the often quoted 
statement of Lord Kelvin ‘I often say that when you can measure what you are 
speaking about, and express it in numbers you know something about it, but 
when you cannot measure it, when you cannot express it m numbers your 
knowledge is of a meagre and unsatisfactory kind it may be the beginning of 
knowledge, but you have scarcely, in your thoughts, advanced to the stage of 
science whatever the matter may be ’ This strict formulation of the essentiality 
of measurement is often disput^ by those engaged in such fields as the social 
and behavioural sciences, where the problems of measurement are difficult and 
there IS much objective empirical observation and qualitative theorizing without 
measurement being possible It may be questioned whether the physical sciences 
which are totally based on measurement and mathematical formalism are 
suitable paradigms for other domains of knowledge However the universal 
importance of measurement cannot be disputed These, and other fundamental 
issues are also discussed in Sydenham (1979) 

When the property of an object or event is characterized by a number, this 
number carries information about the property Modem technology has made 
immense strides in the development of instrumental means of information 
ac^i^'/sii.'on fr-cm cbfecissftd ThanforinsCKfn is cno3<ded at the 

form of a physical signal and can be processed by a variety of information 
machines The information can be output in the form of a number representing 
a physical property, m other words a measure, or used for decision or control 
These powerful modem means of information acquisition and processing, 
constitute the nerves and brains of an immense variety of modern technical 
systems from chemical and electricity generating plant to aircraft and space 
vehicles Measurement and related processes have thus acquired a vital techno- 
logical importance Fmkelstem (1977) further expands these aspects 

Measurement is thus universal and all pervasive As such the process of 
measurement of physical properties seems intuitively obvious We learn to 
measure properties such as length or mass as children We find no conceptual 
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problem in measuring the length of an object as say 15 mm or interpreting or 
handling the information. For this reason most textbooks of the physical sciences 
or technology gave, refer to Sydenham (1979), virtually no attention to the 
definition or analysis of the concept of measurement. It may well be questioned 
what place a discussion of the philosophy of measurement has in a handbook 
such as this, substantially devoted to practical problems of physical measure- 
ment. However the foundational concepts of measurement are both of basic 
importance and interest as well as of some practical significance. 

It is only necessary to look at some simple physical measurements to see that 
the concept of measurement is not trivially obvious. If we state that the volume 
of one object is two times that of another, what is the interpretation of two 
times, since division is an operation defined for numbers not objects? What 
about the statement that the temperature of one object is twice that of 
another? 

Besides those physical properties for which there are well defined measure- 
ment scales, there are others for which suitable measures are more problematic. 
Hardness is an obvious example. There are many properties of materials, for 
instance, which are of technical importance and for which suitable scales of 
measurement are difficult to establish. To give a few examples, we have the 
‘spreadability’ of butter, the ‘foldability’ of paper and the ‘strength’ of coal, that 
is the ease with which it can be cut or worked by tools. The theory of measure- 
ment may give an approach to the procedures for the setting up of suitable 
scales of measurement in such cases. 

It is the social and behavioural sciences and their managerial application, 
which give particularly good examples of practical and philosophical problems 
in the formation of scales of measurement. Starting from psychological proper- 
ties like taste and smell, the measurement of which may be of great technical 
importance, we can quote others such as intelligence or alienation of great 
theoretical and practical interest. The level of conflict in a community or the 
standard of living are basic concepts in our study of society, the measurement of 
which presents problems. Finally, even the apparently exact managerial 
accountancy techniques present us with conceptual difficulties in the measure- 
ment of the level of profit. 

In addition to the practical significance of the theory of measurement, it has 
an intrinsic philosophical importance. As the basic means for our understanding 
of the universe, it is essential that the nature of measurement be understood. 
Tlie last hundred years have seen a powerful and fruitful development of the 
understanding of the foundations of mathematics and logic, sciences which in 
the past were based on vague and intuitive foundations. The theory and philos- 
ophy of measurement can be seen as part of that development. 

Tills chapter gives a brief outline of the foundational concepts of measurement 
and their philosophical background. It will as far as possible attempt to be 
simple in its presentation of concepts, while giving the essential aspects of formal 
measurement theory. 
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\2 OUTLINE OF HISTORICAL DEVELOPMENT OF 
MEASUREMENT THEORY 

The anaent Greeks were the first to investigate the philosophical foundations 
of measurement through the practical pursuit of measurement m crafts and 
trade that arose several millema earlier m the course of the Urban Revolution 
m Mesopotarma and Egypt The school of Pythagoras was concerned with the 
philosophy of the relation between numbers and the real world and hoped to 
make arithmetic the fundamental study in ph^ics Plato’s Academy developed 
an extensive theory of magnitudes, thou^ it was concerned more with the 
nature of numbers than with an analysis of the nature of measurement Aristotle 
studied the concepts of measurement in his Metaphysics (Greek writings are 
extensively translated into the English language m such works as the Loeb 
classics ) 

The Middle Ages saw much scholarly study of the theory of measurement 
though not concerned with the apphcation of measurement to scientific observa- 
tion With the rise of modern science, Newton m his development of mechanics 
provided the first comprehensive mathematical theory of a domain of physics 
In his Arithmetica Umiersahs he developed a theory of magnitudes based on 
arithmetic 

The true foundations of the modem theory of measurement were laid by 
Helmholtz ( 1 887) in a thorough logical analysis of the epistemology of counting 
and measurmg The work was part of the beginning of studies in the logical 
foundations of mathematics with which the theory of measurement is closely 
connected An important development of the work of Helmholtz was the 
axiomatization of measurement of additive quantities by Holder (1901) 

The British physicist, N R Campbell provided a lucid and thorough analysis 
(Campbell, 1920) of the fundamental basis of the measurement of physical 
quantities in his Physics the Elements The book can still be read with pleasure 
and profit Campbell’s theory was based on the measurement of physical 
properties for which an empmical operation of addition could be constructed, 
what IS now known as extensive measurement The theory became generally 
accepted under the influence of logical positivism (Cohen and Nagel, 1934, 
Hempel, 1952) 

The theory of measurement from Helmholtz to Campbell and those who 
developed their work, was concerned with physical measurements, which are 
based on additive quantities and quantities derived from them In the social 
and behavioural sciences, however, there are many properties which cannot be 
empirically added nor denved from additive quantities This created great 
philosophical difficulty A report of a committee of the Bntish Association for 
the Advancement of Saence which considered quantitative methods rejected 
measurements not bas«i on additmty, and hence the possibility of psychological 
measurements, and even questioned the status of the thermodynamic scale of 
temperature 
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These rigid positions of the classical theory of measurement were broken 
down by work in the social and behavioural sciences. The concern of the social 
sciences with the concept of utility led through the work of the early writers 
Bentham and Pareto to the classical work of von Neumann and Morgenstern 
(1944). Their axiomatic formulation of the theory of utility has been the basis 
of much work in the theory and practice of measurement in the social sciences. 
In psychology S. S. Stevens carried out much fundamental work on developing 
an appropriate analysis of the nature of measurement (Stevens, 1946, 1951, 
1959). Other important workers who should be mentioned are Torgerson 
(1958), who presented an excellent exposition of the fundamentals of measure- 
ment and scaling, and Coombs (1964) who developed a theory of data. 

The proceedings of a conference in the United States (Churchman and 
Ratoosh, 1959) presented a review of the classical approaches to measurement 
as extended to needs of the social and behavioural sciences. Ellis (1966) is a most 
useful and interesting philosophical analysis of measurement. It is not formal in 
approach and is principally concerned with physical measurement though it 
takes into account non-classical theory. 

Modern measurement theory may be said to originate from the work Tarski 
on relational systems and model theory (Tarski, 1954). The theory which may 
be termed representational theory of measurement considers measurement, 
loosely speaking as the establishment of a correspondence between a set of 
manifestations of a property and the relations between them and a set of numbers 
and the relations between them. Suppes and Zinnes (1965) provide a clear 
exposition of the theory, in the development of which Suppes has been one of 
the key workers. Pfanzagl (1968) is the first work devoted to the theory of 
measurement with a representational approach. Krantz et al. (1971) published 
a very detailed and thorough account of the representational foundations of 
measurement, though only part of their work has been published. At the time 
of preparation the most recent release on this subject is Roberts (1979). 

While the theory of measurement finds a place in most modern texts on 
quantitative psychology, sociology and the like, it receives very little, if any, 
attention in the literature of physical measurement and instrumentation. 
Surveys of the theory (Finkelstein, 1973, 1975a) have stimulated some awareness 
in the foundations of measurement among measurement and instrumentation 
engineers. They have begun to bridge the large epistemological gap currently 
existing between theorists and hardware designers’ understanding of measure- 
ment systems. 

1.3 THE NATURE AND PROPERTIES OF MEASUREMENT: 

AN INFORMAL DISCUSSION 

Before embarking on a formal definition and analysis of the measurement 
process it is first examined informally. The purpose is to highlight the principal 
aspects of its nature and properties without employing the rigorous but not 
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always familiar concepts and symbolism of the logical foundations of mathe- 
matics 

The outset of the discussion will be the informal definition of measurement 
presented at the beginning of the chapter 

‘Measurement is the process of empirical, objective, assignment of 
numbers to properties of objects or events of the real world in such a 
way as to describe them 

The definition will now be analysed and discussed 

Firstly, measurement is the assignment of numbers to properties of objects 
and events It is thus the description of properties of objects or events and not of 
the objects or events One measures the length of an object, the temperature 
of an object and so on Measurement presupposes the existence of a clear 
concept of a property as an abstract aspect of a whole class of objects of which 
individual instances or manifestations arc the subject of measurement 

The definition stales that the assignment of numbers m measurement is such 
that the numbers describe the property of the object or event The meaning can 
be explained as follows Consider that a number, or measure, is assigned by 
measurement to the property of an object, and other numbers are assigned by 
the same process to other manifestations of the property Then the numerical 
relations beiw een the numbers or measures, imply and are implied by empirical 
relations between the property manifestations Thus if the numbers, assigned to 
the manifestations of a particular property in two objects by measurement are 
equal, this implies that the two properly manifestations are empirically in- 
distinguishable Conversely empirical mdistinguishabihty implies the equality 
of measures Again if the numbers assigned by measurement to the manifesta- 
tions of a particular property, in a series of objects, can be placed m order of 
increasing magnitude, this implies that there is an empirical relation which 
would result in the placing of the objects in the same order in respect of the 
property Conversely, an empirical order among manifestations of the property, 
implies the same order among the measures This correspondence between the 
numencaf relation among measures and the empincaf refation of the corre- 
sponding property manifestations is all that is basically expressed by the more 
rigorous formal definition to be given later 

The above clearly indicates that measurement is a process of comparison of a 
manifestation of a property, with other manifestations of the same property 
This IS a common part of many informal definitions of measurement Many 
definitions however go further, to state that the measure of a property expresses 
the ratio of the magnitude of the property to a standard magnitude taken as 
unity As indicated m the introduction to this chapter, this begs the essential 
question of what measurement is The statement is untrue for many scales of 
measurement and would make the measurement of many properties impossible 
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There can be no ratio of two properties, only a ratio of their measures. The 
empirical relation corresponding to the ratio of two measures must be analysed, 
if it is to be meaningful. For many scales of measurement, measures do not 
correspond to a ratio to the unit magnitude. To take an obvious example, the 
temperature of a body on the Celsius scale is not the ratio of the temperature to 
a unit degree Celsius. As another example, intelligence of a person, for instance, 
cannot be measured as a ratio, to the intelligence of a person having unit 
intelligence. 

There is a divergence of views as to whether any descriptive assignment of 
numbers is adequate for the process to qualify as measurement. At one extreme, 
broadly in the social and behavioural sciences, there is the view that any 
empirical, objective assignment of numbers which describes a property mani- 
festation, can be termed measurement. At the other extreme 'is the view that 
only numbers which reflect in some way a ratio to a unit magnitude of a property 
are true measures. This is the classical view and that of most informal definitions 
of measurement in physics. Many other definitions imply that to be true mea- 
surement, the assignment of numbers must imply at least an empirical order 
among the property manifestations, corresponding to a concept of ordering 
according to magnitude. The author is an advocate of the first view (Finkelstein, 
1975a, b) though without wishing to impose the view on others. 

The matter will be considered later in the chapter in which it will be argued 
that the assignment of symbols other than numbers may essentially be very 
close to measurement. 

The next aspect of the definition of measurement which requires discussion 
is the fact that measurement is an objective process. By this is meant that the 
numbers assigned to a property by measurement must, within the limits of error, 
be independent of the observer. It is not uncommon for properties such as the 
suitability of candidates for a post to be valued by judges on a scale of 0-10. 
Numbers resulting from such a valuation cannot be considered measures, 
unless it is established that the same numbers would, within acceptable limits 
of error, result from any valuation process of the subject using the same pro- 
cedure. 

The informal definition of measurement proposed stresses the fact that 
measurement is an empirical process. This means first that it must be the result 
of observation and not, for example, of a thought experiment. Further, the 
concept of the property measured must be based on an empirical relation. For 
example, the use of the Universal Decimal System of library classification, 
results in the assignment to documents of numbers which describe their content. 
The assignment is essentially objective, for classification by trained observers 
would, with a low error rate, result in the same number being assigned to the 
same document. Let us accept, just for the sake of this argument, that assignment 
of numbers which describe class membership can be measurement. Even then 
the above library classification cannot be measurement. The reason is that the 
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classification of knowledge on which it is based is substantially an arbitrary 
convention. 

It IS convenient to discuss at this stage the properties of measurement which 
give it the key role m science. 

As indicated m the introduction to the chapter there is first and foremost 
the objectivity of measurement. A measure is an objective description and hence 
a proper scientific datum. Conversely, it can be claimed that if we can arrive at a 
totally objective description of a property manifestation, the most vital step 
towards measurement has been taken. 

The second property of measures is that they are descriptions of great con- 
ciseness. A single number tells os what ii would take many words to express. 

Measurement gives, further, a description which is precise, pinpointing by a 
single number a particular entity, where a verbal description indicates a range 
of similar but differing things. 

A measure of a property gives us an ability to express facts and conventions 
about it in the formal language of mathematics. Without the convenient notation 
of this language, the complex chains of induction and deduction by which we 
describe and explain the universe would be too cumbersome to express 

It follows from what has been said that description by numbers is not good in 
itself. The only value of measurement lies in the use to which the information is 
put. Science is not just the amassing of numerical data, it depends upon the way 
in which the data are analysed and organized. 

Finally, we return to the fact already mentioned that measurement enables 
the measurand to be expressed m signals which can be handled by machines 
This will be analysed later in the chapter. 


1.4 THE ELEMENTS OF THE FORMAL THEORY OF 
MEASUREMENT 

It IS now necessary to look at the formal representational theory of measurement. 
While the preceding general discussion is adequate for insight, a full presentation 
\Vrfc TiwdeiTi ffrnVosoiAiy Hieasaiement requires recoarse To Tigorucft 
symbolism. 

The brief presentation which follows is a development of accounts given in 
Pfanzagl (1968) and Krantz et al. (1971). 

A representational theory of measurement has four parts: 

(i) an empirical relational system corresponding to a quality; 

(u) a number relational system; 

(iii) a representation condition ; 

(iv) a uniqueness condition. 

These will now be considered. 
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(i) Quality as an empirical relational system 

Consider some quality (for example length, hardness, etc.) and let qi, ^ 2 ^ 

Qi, . . . represent individual manifestations of the quality so that we can define a 
set of all possible manifestations: 

n = {Wi,W2,...,W, (1.2) 

represent the class of all objects manifesting elements of Q. 

Consider further that there exists on Q a set ^ of empirical relations Ru 
R 2 ,...,Ri,...,R„ and let us denote 

( 1 . 3 ) 

Then the quality is represented by an empirical relational system 

^ = ( 1 . 4 ) 


(ii) Numerical relational system 

Let N represent a class of numbers and let 

( 1 . 5 ) 

be a set of relations defined on N so that: 

= ( 1 . 6 ) 

represents a numerical relational system. 

Commonly ./C is just the real number line. 


(iii) Representation condition 

The representation condition requires that measurement be the establishment 
of a correspondence between quality manifestations and numbers in such a 
way that the relations between the referent property manifestations imply and 
are implied by the relations between their images in the number set. 

Formally, measurement is defined as an objective empirical operation 

M:Q-*N ( 1 . 7 ) 

such that 2. = <Q, is mapped homomorphically into (onto) = <Ar, 
by M and F. F is a one-to-one mapping, with domain SI and range 

( 1 . 8 ) 

so that we can denote 


Pi = F(Ri);PieS>-,R,€^ ( 1 . 9 ) 

P is an n-ry relation if and only if it is the image under F of an n-ry relation. 
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By a homomorphic mapping we mean that for all RtS ^ and all P,€Si and 
Pi = PiR,\ 

Uqi, .g., (i lo) 

Measurement is a homomorphism because A/ is not one-to-one, it maps 
separate but indistinguishable property manifestations to the same number 
Then 

= ( 111 ) 

constitutes a scale of measurement for tii = M(g,) The image of q, m N under 
M IS called the measure of on scale iP 

(iv) Uniqueness condition 

The representation condition may be valid for more than one mapping M One 
may admit certain transformations from one scale of a property to another 
without invahdating the representation conditions The uniqueness condition 
defines the class of scale transformations to those for which the representation 
condition is valid 

(v) Uncertainty 

The above definition has been given in terms of deterministic relations and 
mappings However, all experimental observations are accompanied by error 
Uncertainty should be introduced into the representational theory of measure- 
ment Leaning and Finkelstein (1979) have presented such a theory based on 
the concepts of a probabilistic relational system and probabilistic homo 
morphism 


QUALITY CONCEPT FORMATION 

Measurement presupposes something to be measured Both in the historical 
development and logical structure of scientific knowledge, the formulation of 
a theoretical concept or construct, which defines a quality, precedes the develop- 
ment of measurement procedures and scales 
Thus the concept of ‘degree of hotness’ as a theoretical construct, interpreting 
the multitude of phenomena involving warmth, is necessary before one can 
conceive and construct a thermometer Hardness must, similarly, first be clearly 
defined as the resistance of solids to local deformation, before we seek to 
establish a scale for its measurement The search for measuring some such 
conceptual entity as ‘managerial efficiency’ must fail until the concept is clarified 
The formation of concept m empirical saence (Hempel, 1952) is an important 
and much discussed subject Here consideration will be confined to the concept 
of quahty 
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The basic notion is that of a manifestation of a quality, an abstract, single 
sensed aspect of an object or event, such as, for example, the smell of a substance. 
Observation of the real world leads to the identification of empirical relations 
among these single manifestations. Examples of such relations are similarity, 
difference and the like. As a result the concept of a quality is formed as an objec- 
tive rule for the classification of a collection of empirically observable aspects of 
objects into a single set, together with the family of objective empirical relations 
on that set. The resulting relational system is a quality and each single member 
of the set is termed a manifestation of the quality. This was formally set out in 
Section 1.4 where quality was defined as the set (2, of all manifestations q of the 
quality, Q = {q'}, together with the set of all relations R on Q. 

We can thus see that there is a difficulty in the measurement of such qualities 
as beauty. The existence and meaningful use of the word beauty indicates the 
usefulness of the concept. However, there is not ah objective rule for classifying 
some aspect of observable objects as manifestations of beauty. Similarly, there 
are no objective empirical relations such as indistinguishability or precedence, 
in respect of beauty. The basis for measurement of beauty is thus absent from 
the outset. 

When there exists a clearly defined quality, as a set into which its manifesta- 
tions can be objectively and empirically classified, together with a set of empirical 
relations, then we can always find some symbolical relational set by which it 
can be represented. Thus if a quality is definable as above, we can set up for it a 
scale of measurement, in its broadest sense. However, this is a relatively late 
state of development of the concept of quality. 

In some cases the concept of a quality arises from invariances in numerical 
laws arrived at by measurement. An obvious example is Young’s modulus. This 
quality is arrived at from Hooke’s law: the observation that for an extensive 
class of materials, strain is proportional to stress. In general, however, one 
starts from some direct concept of a quality and then seeks measurement scales. 

Usually given a concept of a quality and a scale of measurement based on that 
quality, the accumulation of data by measurement leads to the clarification and 
re-evaluation of the quality concept. This in turn leads to improvement in the 
measurement scale. The process is an ascending spiral. The history of develop- 
ment of thermometry from simple devices and concepts to the modern thermo- 
dynamic definition of temperature is an example. 

One of the principal problems of scientific method is to ensure that the scale 
of measurement established for a quality yields measures, which in all contexts 
describe the entity in a manner which corresponds to the underlying concept of 
the quality. For example, measures of intelligence must not disagree with our 
basic qualitative concept of intelligence. It is usual that once a scale of measure- 
ment is established for a quality, the concept of the quality is altered to coincide 
with the scale of measurement. The danger is that the adoption in science of a 
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well defined and restricted meaning for a quality like intelligence, may deprive 
us of useful insight which the common natural language use of the word gives 
us 

Finally, let us explain the concept of an empincal quantity If there is an order 
relation m the quality relational s^em, enabling us to order quality mani- 
festations in a way which has formal similarity to the relations equal, greater 
and less, then the quality is termed a quantity 


1.6 SOME EMPIRICAL RELATIONAL SYSTEMS AND DIRECT 
SCALES OF MEASUREMENT 

1.6.1 Object and Scope of Section 

In the present section an attempt will be made to analyse some qualities as 
empirical relational systems and to explain the logical basis of deriving a scale 
of measurement for them Extensile measurement, that is measurement of 
physical quantities for which we can construct an operation having the formal 
properties of addition, is the basis of physical measurement and will be con- 
sidered in detail Elegance and ngour will be eschewed in favour of as much 
clarity and simplicity as possible 


1.62 Extensile Measurement 

The extensne scales of phjsical measurement are based on establishing for the 
quality 2 of empincal objects, for which a scale is to be determined, of an 
empincal ordenng with respect to 2 of the class D of all objects possessing 
elements of Q, together with an operation « of combining the objects, elements 
of n, which has with respect to 2 the formal properties of addition Such scales 
are known as extensive The theory of the form of construction of such scales of 
measureixiflDf nnginates Xrom the work of Holder ) 

and IS luadly developed in the work of Campbell (1920) There has, however, 
been much work concerned with establishing elegant systems of axioms of 
representation (Pfanzagl 1968, Krantzer al, 1971) 

The abov e will now be stated more formally the basts of a scale of measure- 
ment of 2 IS the definition of the set Q 

Secondly, there must be an operational procedure which establishes on the 
set of objects Q possessing 2 an empirical equiv alence relation ~ and a transitiv e 
empirical relation -< w ith respect to 2 such that <Q, ~ is an order system 
Without discussing order in detail we shall use the simple definition of order 
s>stem given m Appendix 1 A 


TIII.ORV and PIIIl.OSOi’MY OP MDNSURnMKNT 


13 


Finally, consider objects »v, , us , u' 3 , u'^ 6 f2 exhibiting property manifestations 
</i • <7 j . <73 • For an extensive measurement scale there must be an operation 

of combining v.). u'j witli respect to r/j and Qi whicli we shall denote by gj <= r/i 
U’ith the Slime formal properties as addition. 

F'or all f/ e Q 


(i) r/'fy.eQ 

(ii) r/i ' r/, - qz 

(iii) r/i - r /2 ~ q. r/, 

(iv) f/i "■(‘72^<?3) ~ (^i ■''< 72 ) “<73 

(v) if qz ~ <73 tlicn r/, ^ <?2 ~ <?i <73 

if </3 ~ (/2 then <?i o <73 ~ < 7 i ° <72 

(vi) if </,. r/,, r/j bear to each 

relation - and qj<q\, then 
number ii such that q\ -< r/, qz 



( 1 - 12 ) 


(1.13) 

commutativity 

(1.14) 

associativity 

(1.15) 


(1.16) 


(1.17) 

other the Archimedean 


there is a postulate 


••• qn 

(1.18) 


Wifi) these definitions the empirical relation system (Q, hasa structure 

of the same properties as the numerical relation system <Rc, =, <, + >. 

With an empirical ordering operation and an 'additive combination’ thus 
csttiblishcd one proceeds to the setting up of a scale. 

A single object with .s, 6 Q is chosen as standard and assigned the number 1, 
that is, it is chosen as the unit of the scale. One then constructs or seeks another 
object with .Vj e Q such that s\ ~ .Sj. One can then construct a standard Sz ~ 
.V, .v'l and assign it the number 2, S 3 ~ Sz ° .Sj and assign it the number 3, and 
so on. Fractional standards can be generated by constructing S]/ 2 , s\,z e Q, 
.''i< 2 '' '' 1/2 ~ •’’1 assign s^/z the number 4. Thus we generate 

S = {...,.S-,,3,Sj,52,S3,...} (1.19) 


Any q eQ is then measured by finding the element s, to which it bears the 
relation and assigning to it the number corresponding to s,-. 

.As an example, in the measurement of mass the equipoise balance offers the 
means of establishing empirical order. If the arm balances, the masses in the 
two pans arc equivalent. The tipping of the balance indicates that one mass is 
’heavier’ than the other. Thus we can rank a series of weights in order of heavi- 
ness. The lumping together of two objects is with respect to mass an operation 
with the properties of addition. 

A variety of problems with respect to the form of measurement discussed 
above are examined in the literature, such as various forms of necessary and 
sufficient representation conditions, the possibility of constructing scale based 
on operations resulting in alternative numerical representation such as for 
instance, multiplicative sailes based in a homomorphism into <Rc. = , <, > and 
conceptual difiiculties imposed by the limitations in practice of the size of 
standard. Readers arc referred to Krantz et al. (1971). 
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1 .6 J Matching Scale 

A matching scale is based on the establishment on the set of quality manifesta- 
tions Q of an empirical indifference relation ~ 

Given <Q, -^>,3 set of differing elements s,eQ (s, ~ 'Sj if / are selected 
to form a standard set 5 = {s„ Sj , , s^} Numbers (or other symbols) «, e N 

are then assigned to each s, e S, the same number n, not being assigned to two 
differing elements S( € S (if s, ~ Sj, nj # itj) The fundamental measurement 
operation M of the scale consists of an empirical operation in which measurands 
6 Q are compared with members of the standard S If ~ s, it is assigned the 
number n, 

An example of this form of scale is acolour code in which the relation ‘matches’ 
constitutes the empirical indifference relation 
In the establishment of a colour code we first establish the indistinguishability 
relation based on a colour match This indistinguishability relation can be 
tested for symmetry and transitivity Reflexivity is implicit The relation is 
therefore an equivalence relation We then select as standards a set of coloured 
objects, each a distinct colour manifestation Each is assigned a different number 
or other symbol such as a word label Any unknown colour manifestation is 
compared with the standards and if ti matches one of them it is assigned the 
same number or symbol as the standard 
Matching scales are not generally considered measurement scales, since they 
are not quantitative in the sense that they do not establish on Q relations 
formally similar to the relations greater or less Also it is not generally practical 
to establish sufficient standard elements to ensure that every qisQ can be 
matched with a standard and hence assigned a measure 


1 .6.4 Ranking Scales 

In ranking scales, an empirical order system <g, ~, ■<> is established on Q A 
set of differing standard objects having 5| 6 6 ts then selected and arranged in 
an ordered standard senes S = {s,, , s,} according to (Q, -<> Numerals 

are assigned to each s, say i in such a way that the order of numerals corresponds 
to the order in S of standards to which they are assigned Any qeQ can then be 
compared with the elements of 5 in the same way as in nominal measurement 
Ifg tears the relation ~ to any s,e 5 it is assigned the numeral of s, If an entity 
IS not equivalent to any s^e S one can determine between which two standard 
elements it lies in the empirical order system 

The test example of a ranking scale of measurement is the Mohs scale of 
hardness of minerals 

Ten standard minerals are arranged in an ordered sequence so that precedent 
ones in the sequence can be scratched by succeeding ones and cannot scratch 
them The standards are assigned numters 1 to 10 (The sequence is talc 1, 
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gypsum 2, calcite 3, fluorite 4, apatite 5, orthoclase 6, quartz 7, topaz 8, corundum 
9, diamond 10.) A mineral sample of unknown hardness which cannot be 
scratched by quartz and cannot scratch it, is assigned measure 7. 

1.7 INDIRECT MEASUREMENT 

The preceding section considered measurement scales formed by direct mapping 
from a quality relational system to a numerical relational system. Frequently, 
however, scales of measurement for qualities are constructed indirectly through 
a relation of the quality to be measured and other qualities, for which measure- 
ment scales have been defined. The reason may be, for example, the impossibility 
of setting up a satisfactory measurement scale directly. Thus for example we 
cannot set up an extensive measurement scale for viscosity or density, since 
there is no appropriate combination operation having the properties of addition. 
Another reason is the wish to set up a consistent set of measurement scales, in 
which a minimal set of qualities with direct scales is defined, and scales for other 
qualities are defined by derivation from them. 

In its simplest form consider a case when every object that manifests the 
quality to be measured exhibits a set of other qualities which are measurable. 
Then to each a manifestation of the measurand quality, there corresponds a set 
of measures of the associated qualities. These associated or component measures 
can be arranged in an ordered array. If manifestations of the measurand quality, 
have identical arrays of component measures, if and only if they are indistin- 
guishable, then the array of component measures characterises the measurand. 
This will now be expressed formally. 

Consider a quality .So, for which it is desired to construct a scale of measure- 
ment. Mq consists of the set Q of all the manifestations of the quality and of SIq 
a set of relations among the manifestations. Consider now the class of all objects 
f2 which exhibit manifestations of the quality .2. Let each element of Q. also 
exhibit logically independent qualities {Jj, Jj, - • • , ^n}- 

Let us assume that there exists for each element of the above scale of measure- 
ment such as 

yT,., Mi, Fi> (1.20) 

Assume that to each object w 6 Q there corresponds one and only one 
% e Go snd one and only one element: 

q = <gi,...,gi,...,g„> (1.21) 

etc.) of the product set Xf=i6>- That element corresponds to an 
ordered array of measures; 

M(go)= <Mj(g, ),..., M„(g„)> 
where M, ■((?,) is the image of e Qi in Ni under Mj and so on. 


( 1 . 22 ) 
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If now for any go- (Jo e Go. 9o ~ ^o. M(^o) = where ~ represents 

empirical indistinguishability, then we can say that M(qo) characterizes go 
If we can combine the various component measures or in other words map 
them into a single number so that numbers assigned to the quality manifesta- 
tions by this process, imply and are implied by empirical relations between the 
quality manifestations, then this sets up an indirect scale of measurement This 
will now be explained formally, referring to the discussions above 
Let there exist a mapping 0 from the product set X"=i^i ^ number set 
No 

no = 0CM,(g,). , M„(g„)) (123) 

isthentheimageofMi(gi), , , M^g,) m N q under 0 Wecanthen 

see that the composition of (M,, , M,, , M,) and the correspondence 

between elements of Qa and X? i Q, constitute a mapping from Qo to A^o. which 
we shall denote A/q 

Consider that there is a set of relations on Nq giving a relational system 
^ ss (^No, ^o) *hat there exists a correspondence Fq 0to 0*^ Then if 
Afo> Fo map SLq homomorphically into (onto) then 

= .5^-. .Afo.fo) (124) 

constitutes an indirect measurement scale for .So Typically .So = <Goi 
that is we have an empirical order system on our measurand quality The 
mappings A/q, Fq are such that whenever the objects are ordered according to 
= , <> they are also ordered according to <No, =, <> 

Consider as an example the scale of measurement of density of homogeneous 
bodies Each such body possesses mass, say m, and volume u (where m and i> are 
assumed to be measures on already defined scales) It is an empirically established 
law that objects of the same material, and hence conceptually of the same 
density, have the same ratio (m/v) When different materials are ordered accord- 
ing to our concept of density they are also ordered according to the respective 
ratio m/v Hence a scale of measurement of density is based on the ratio of mass 
to volume 

A few observations should be made here The mapping 4> is not unique in its 
order preserving properties with respect to density For instance, (m/u)^ would 
be an equally valid derived measure of density The form of 0 is chosen to result 
in the greatest simplicity of mathematical relations involving density The 
properties of the function 0 are an idealization of real observations 
In general once an indirect scale of measurement has been defined it is treated 
as a definition of % 

The description of qualities by multidimensional arrays of measures, where 
the measures are not combined, is known as multidimensional measurement 
An example might be a characteruation of a shape by a set of geometrical 
measures 
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Measurement by combining component measures, so that the resultant 
number characterizes the place of the measurand quality manifestation in an 
empirical order is termed conjoint measurement. It is based on the conjoint 
establishment of order on the measurand quality and on the component 
qualities. 

In physical measurement indirect measures of qualities are obtained as 
multiplicative monomial functions of the measures of component qualities. 
This is generally known as derived measurement (Krantz et al, 1971), though 
see a more general statement term of the term derived in Finkelstein (1975a). 

1.8 UNIQUENESS: SCALE TYPES AND MEANINGFULNESS 

As stated in Section 1.4, the requirement that the fundamental measurement 
procedure of a scale should map the empirical relational system .2 homo- 
morphically into the numerical relational system Jf does not determine the 
mapping uniquely. 

Thus there is an element of arbitrary choice in the setting up of scales of 
measurement. In the case of scales based on additive combination, for instance, 
the choice of the unit standard is arbitrary. 

In the case of a ranking scale of measurement the actual numbers assigned to 
the standards are arbitrary, subject only to the requirement that they should be 
in the required order. 

The requirement of homomorphism thus defines a class of scales which may be 
called equivalent. The class of transformations which transform one member 
of a class of equivalent scales into another is called the class of admissible 
transformations. The conditions which admissible transformations must satisfy 
are known as the uniqueness conditions. They specify that a scale is unique up to 
a specified transformation. 

Consider as an example a mapping which maps homomorphically the 
empirical relational system <2, o> into the numerical relational system 

(Re, =, <, + >, that is, it is the fundamental measurement procedure of a scale 
based on additive combination. If all measures M( ) be replaced by aM( ) 
where a is a real positive number, then the measures aM( ) preserve the re- 
quired homomorphism and multiplication by a real positive number is an 
admissible transformation. In effect there has been a change of unit to 1/a on 
the original scale. 

We can classify scales by the classes of transformations admissible for them 
(Stevens, 1959). Let m be numbers representing measures on a scale and let m' 
be corresponding numbers on the transformed scale. The generally accepted 
classification of scales is given in Table 1.1. 

The problem of the meaningfulness of statements made about a quality in 
terms of its measures is important. Such a statement is meaningful if its truth 
is unchanged by admissible transformations of the scales of measurement, in 
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Table 1 1 Classification of soles of measurements 


Class of admissible transformations Scale type 


M = f{M) where Fi 
M = F{Nf) where F{ 

_ f ctM + ^ a > 
~)aM a > 


M ={ 


) IS any one to one substitution 
) IS any monotonic increasing function 
0 
0 


nominal 

ordinal 

interval 

ratio 


other words, if it reflects the empirical relational system on which the scale is 
based and not just the arbitrary conventions of the scale 
We say that a fc-ry relation P is meaningful if 

P(M(q,X = . M'(gJ) (125) 

where M -* M is any admissible transformation 
Thus as a very simple example, it is meaningful to speak of the ratio of two 
masses, since that ratio is invariant with resfiect to changes of the unit of mass, 
but It IS not meaningful to speak of the ratio of two hardnesses measured on the 
Mohs scale, since that ratio would be changed by a monotonic increasing 
transformation of the scale 

Where P{ ) is an independently empirically established relation among 
objects, expressed in terms of measures M( ) it is still meaningful, even if it is 
not invariant with respect to admissible transformations of the scale of measure 
ment 

Another view of meaningfulness which can be taken is that only such state- 
ments involving measures are meaningful which can be logically traced to the 
empirical operations on which the measurement is founded 
One aspect of the meaningfulness of interpretations of measurement data is 
the description by statistics Table 1 2 presents statistical measures meaningful 

Table 1 2 Statistical measures appropriate to measurements made on various classes 
of scale 


Measures 


Scale type 

Location 

Dispersion 

Association 

Significance 

nominal 

mode 

mfonnatiOQ 

transmitted 

mformation 

chi-squared 

test 

ordinal 

median 

percentiles 

rank order 
correlation 

sign test 

interval 

mean 

standard, deviation 
average deviation 

product mean 
correlation 

t test 
f-test 

ratio 

geometric and 
harmonic mean 

% variation 
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for measurements on various classes of scale. Statistical measures in a row of 
the table corresponding to a particular scale are also meaningful for the scale 
listed below, but are meaningless for those listed above. 


1,9 MEASUREMENT AND OTHER FORMS OF SYMBOLIC 
REPRESENTATION 

Measurement is only one form of representation of entities by symbols. It is 
closely related to other forms of symbolization. 

At its simplest level a symbol may be only a name or label which can be used 
to refer to the object and handle information about it. The number of an item 
of equipment on an inventory list is such a symbol. The inventory number can 
be used to handle information about the equipment in a compact and convenient 
manner without the equipment itself being handled. 

The type number or class name of a class of equipment is a symbol which 
describes the equipment, insofar as it denotes its similarity in certain respects 
to other equipment bearing the same symbol. Thus the class label ‘Concorde’ 
referring to an aircraft describes insofar as it denotes its similarity to other 
aircraft bearing the label ‘Concorde’ and its difference from aircraft described 
by other class names. The statement ‘John is flying to New York by Concorde’ 
gives information to the recipient of the message in a compact form, without 
him or her having to see the aircraft. 

Library classification is another example to be considered (see Section 1.2). 
Taking an imaginary classification system for the contents of books and docu- 
ments we may have a set of rules by which a combination of letters and numbers 
is assigned to describe the contents of an item. Two items with the same number 
say FA 592 have equivalent content in the sense that they deal with the same 
class of subject matter. The designation FA 592 however may give further 
information. Depending on the rules of classification and assignment of class 
symbols, it may be possible to determine the relation of items so designated, to 
those denoted, for example, by FA 592 or FB and so on. 

Properties of objects can be described by the symbols of natural language. 
For example the words ‘red’, ‘blue’, and ‘green’ are closely similar to labels on a 
nominal scale of colour. The sequence ‘hot-warm-tepid-cool-icy’ has some 
similarities to an ordinal scale of temperature (Pfanzagl, 1968). 

The examples given are not in any way exhaustive, nor have they been 
rigorously analysed. They show that there is a large number of practical cases 
in which objects of the real world and their attributes and characteristics are 
represented not by numbers but by conventional symbols. This does not con- 
stitute measurement, but shares with measurement its most essential charac- 
teristics: namely, the representing symbol designates the entity represented, 
relations between correspond to relations between represented entities, and 
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symbols can be used to achieve responses corresponding to the entities they 
represent 

The formal representational theory of measurement based on these definitions 
given m Section 1 4 can be extended to the more general case of representation 
by symbol systems 

A symbol will be defined here as an object or event, which has a defined 
relation to some entity, for the purpose of eliciting a response appropriate to 
that entity in its absence 

Let Q be some set of entities and let R be some set of relations on Q con- 
stituting a relational system 

S=<Q,^'> (126) 

Note that Q may now be a set of objects events, abstract entities, etc and 
need not be empirical Now let 2 be a set of objects or events to be used as 
symbols and let ^ be a set of relations defined or existing on Z to constitute a 
symbol relational system 

^=<Z.^> (127) 

With F a mapping from ^ onto #as in the case of measurement, we can again 
define M as a mapping from Q into (onto) Z such that M and F map 2 homo 
morphically into (onto) ^ 

Then 


jT Af. f > (128) 

Let 2 t = A/(^j) be the image of q, in Z under M Then z, is termed the symbol 
of (or for) q, under and q, is termed a meaning or referent of r, under is 
termed the symbolism or code 

If we are given z, and the code^ we have information about any qc^ot- which 
r, IS a symbol and about relation between q, and other members of Q 
All the considerations of the formal theory of measurement can be generalized 
to the symbolization of any relational systems by any general symbol system 
k^rraj^ieexarapfie w ffi'oe giv enio ifiustratefnat app’iication ol representationa’i 
theory to the representation by symbols Consider a set of aircraft denoted by 
Q Consider further that we may dnnde the aircraft into three types partitioning 
the set into three subsets There is thus an equivalence relation which relates 
two aircraft which are members of Q and are of the same type Then we may 
denote this division of aircraft into types as a relational system ^ = <Q, ~> 
Let us now choose three symbols A, B, C, one corresponding to each of the three 
aircraft classes, giving us the symbol set Z = {>4, B, C} There is on the symbol 
set an identity relation =, which relates any two identical symbols of Z We 
thus hav e a symbol relational system The class labelling is a mapping M which 
assigns to any aircraft m Q the symbol corresponding to its class name 
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F is the correspondence of ~ and s. 

If we are given the symbol for an aircraft, say A, we know that it is of the same 
type as any other aircraft with the symbol A. This is the principle on which the 
type name ‘Concorde’, discussed previously, is assigned to an aircraft. 

Of course, as has already been mentioned, natural language words are used 
to represent and describe objects and events of the real world as well as their 
properties. Indeed they are the most generally used method for such symbolic 
representation or description. 

The above representational theory may be used for the analysis of such 
descriptions. However the comparison of colour description in natural language 
with colour labels on a colour matching scale serves to illustrate some of the 
problems. Take the symbols ‘muddy brown’ and ‘A33’ which may be given to 
the same colour using natural language and a colour chart respectively. In 
addition to the subjectivity and vagueness of the linguistic description as 
compared with the colour chart label, we note that the expression ‘muddy 
brown’ communicates more than a denotation of colour. There is firstly the 
connotative meaning communicated by association with the properties of mud : 
dirtiness, stickiness and so on. Further there is an affective component of 
meaning. Something is communicated by the message originator concerning 
his or her unfavourable feelings about the colour. (See Leech (1974) for a good 
introduction to the descriptive properties of natural language.) 

To conclude this discussion of representational systems it has been argued 
that the theory of the foundations of measurement can be extended to embrace 
forms of representation of entities by symbol systems other than numerical ones. 
The basic concepts of the foundational theory of measurement, such as repre- 
sentation, uniqueness, meaningfulness and like, can be extended to general 
symbolic systems (Finkelstein, 1975b). 

Non-numerical representations are like measures in that they describe the 
entity represented and some of its relations in a form which enables information 
about the entity to be conveniently manipulated. 

The most important special feature of measurement is that the assignment of 
numbers in measurement is objective and represents empirical facts. 


1.10 MEASUREMENT, INFORMATION, AND INFORMATION 

MACHINES 


1.10.1 Information 

The concept of information will now be explained. The essence of information 
is symbolic representation. The elements of some set of objects or events, which 
we term symbols, carry information about the elements of some other set of 
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entities, the set of referents, if a deOned relation, preferably one to-one, exists 
between the elements of the two sets More generally there may be relations 
among the symbols so as to constitute a symbolic relational system and there 
may be relations among the referents so as to constitute a referent relational 
system The symbols carry information about the referents, and the relations 
thej bear to each other, if there is a defined mapping, preferably isomorphic, of 
the referent relational system into or onto the symbolic relational system 
Information consists of the symbol together with the relation it bears to the 
referent 

Using the notation introduced before, if we have a referent relational system 
SL = <Q. &') a symbolic relational system Z = <Z, and mapping Af Q -» Z 
-.and F & & giving us a code — <.2, Z, M, F>, then 

J = <'i?, 2,> {1 29) 

represents information about a q, for which z, is a symbol and about relations 
between q, and other members of Q 

Measurement is a special case of representation by symbols m which a 
measure Uj, together with the scale of measurement represents information 
/ as n,} about the measurand q, 

The use of the term ‘information’ as given here differs from the usage of the 
term m information theory There arc, however, essential similarities 
In information theory we consider an information transmission channel which 
transforms elements x, of a set of inputs X into elements of a set of outputs Y 
This transformation is m general many-to*many 
The quantity of information provided by the occurrence of an output y about 
the occurrence of an input x* is defined as 


log 


PjxM 

P(xt) 


(130) 


The base of the logarithm defines the unit of the scale 
If we consider Q as analogous to X, S as analogous to Y, and M as analogous 
to the X to y transformation of the communication channel, we see that the 
underlying concepts of ‘information’ in this chapter and m information theory 
are fundamentally similar In both information is knowledge about and entity 
provided by an image of the entity under a mapping 
Information theory is concerned with measures of ‘quantity of information’ 
for the special problem of transmission through a channel which transforms an 
input to an output It measures the quantity of information transmitted through 
a channel by the change from a priori knowledge to a posteriori knowledge 
about the channel input provided by the channel output This measure conforms 
to the concept of information as defined m this chapter, though it is designed for 
a restricted problem class only 
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1.10,2 Information Machine 

Having defined information we shall consider a class of machines which acquire, 
process, present and give effect to information. This class includes instruments 
for measurement, computation, communication, and control. 

In the functioning of information machines, the information-carrying symbol 
is cither the instantaneous value of a physical quantity, such as the angle of 
deflection of a pressure gauge pointer, or some parameter of the variation of a 
physical quantity with time, such as the instantaneous frequency of a variable- 
frequency alternating voltage in a frequency modulation system. 

Tlic pointer deflection angle together with the gauge calibration law carries 
information about the input pressure: it is the information-carrying symbol. 

The physical quantity, the magnitude or time variation of which carries 
information, is termed the signal. Thus in the above examples the signals are the 
pointer deflection and the voltage. 

A machine may be defined in general terms as a contrivance which transforms 
a physical input into a physical output for some definite purpose. Thus, for 
instance, a lever constitutes a simple mechanical machine transforming input 
power into mechanical output with the object of achieving mechanical 
advantage. 

An information machine functions by performing a prescribed transformation 
ofan input physical signal into an output physical signal. Thus, the instantaneous 
magnitude or some parameter of the time variation of the output signal is 
related through the instrument transformation to input signal magnitude or 
some time variation parameter. The relevant feature of the output signal, 
together with the instrument transformation law, carries information about 
some referent feature of the input signal, as has been shown in the case of the 
pressure gauge above. 

Tlie requirement to maintain prescribed functional relations between input 
and output distinguishes information machines from other forms of machine, 
tlic purpose of which is to generate and transform power, and determines the 
principles of analysis and design of information machines. 

The distinctive features of a class of machines which maintain specified 
functional relationships between inputs and outputs were recognized in par- 
ticular by Draper et al. (1952) and Kuhlenkamp 0971). 

Tlic nature of instruments as information machines determines the approaches 
to their design, description, and application. This forms the basis of a systematic 
instrument science (Finkclstcin, 1977). 

1.11 MEASUREMENT THEORY IN THE PHYSICAL, SOCIAL, 
AND BEHAVIOURAL SCIENCES 

Tlie present section is intended to discuss the state of application of measure- 
ment theory in the physical, social, and behavioural sciences. It is not intended 
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to consider specific techniques and methodologies of measurement, a subject 
obviously too wide m scope 

1.11.1 Measurement Theory in the Physical Sciences 

The classical theory of measuiemenl {HelmhoUz, 1887, Campbell, 1920) was 
developed to give an account of measurement in the physical sciences 

In terms of this classical theory, measurement m the physical sciences is based 
on establishment of direct extensive scales of measurement for a number of 
physical quantities as described in Section 161 These quantities are used as 
the base of a system Scales for other physical quantities are obtained as derived 
scales that is indirect scales m terms of the base quantities, in the form of multi- 
plicative monomial functions of the base quantities 

This account is not totally satisfactory even for classical physics For example, 
the SI system of units (NPL, 1973) has seven base quantities length, mass, time, 
electric current, thermodynamic temperature, amount of substance, and 
luminous intensity The base unit of current for example is defined as a current 
which if maintained in two parallel conductors of infinite length, negligible 
circular cross section, and placed a specified distance apart, produces a specified 
force per unit length This, then, involves firstly scales of measurement of length 
and force Further it involves a theory of electromechanical interaction, that is 
a law of force between conductors this is essential to enable us to calculate the 
instrument lawofacurrent balance by which the unit of current must be realized 
The actual realization of the definition in terms of infinitely long infinitesimal 
conductors is not possible Even for quantities such as length, for which an 
extensive scale can be established using concatenation of standard lengths, the 
unit of length is in fact defined in terms of the wavelength of light, which is not 
a material object but a construct of physical theory 

The analysis could be taken further, but the examples chosen give an under- 
standing of the deficiencies of the classical theory of measurement as an account 
of the foundations of physical measurement The situation of physics is that it 
consists of a number of axiomatized theories such as Euclidean geometry, 
cfiassical mednames, thermodynamics, electromagnetism, and so on The scales 
of measurement of classical physics are based on the acceptance of these 
theories as representations of the real world and defining the units on that basis 
rather than on the individual axiomatization and establishment of scales of 
particular physical quantities A formal theory of measurement in the physical 
sciences based on this view awaits development Collection of bibliographic 
material, as a start m this direction, is in progress at at least one institution (Van 
Brakel, 1977) 

Special problems arise in quantum aud relativistic physics which cannot be 
handled by the theory of measurement presented above In quantum physics 
the interaction between observer and observed system imposes a limit on the 
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certainty of the joint measurement of the attributes of the system such as 
position and momentum. This imposes a fundamental difficulty for measurement 
theory. The theory of relativity has a large impact on the theory of measurement 
in the physical sciences. In terms of the view presented above it attempts to 
represent reality using a theory different from those used in classical measure- 
ment, for example classical mechanics. The rejection of the concept of simul- 
taneity and an upper limit to velocity are examples of such differences. There 
are thus fundamental developments of application of measurement theory to 
the physical sciences to be undertaken. 


1.11.2 Measurement Theory in the Social and Behavioural Sciences 

A complete analysis of the theory of measurement in the social and behavioural 
sciences would require a detailed discussion of their nature, content and 
methodology. The subject would be too extensive to tackle in this chapter. 
However, some of the key problems of the theory and philosophy of measure- 
ment in these sciences can be simply summarized. 

Firstly one should justify the discussion under one heading of these very 
different classes of science. The reason is that key problems of application of the 
theory of measurement in the social and behavioural sciences are essentially 
the same. 

Firstly, as has already been stated, it is the requirements of the social and 
behavioural sciences which have led to the replacement of the restrictive 
classical theory of measurement by the more comprehensive modern represen- 
tational theory. 

The social and behavioural sciences are concerned very much with such 
attributes or qualities as utility, standard of living, alienation, intelligence, and 
the like. Tlie first problem in attempting to measure them is the difficulty of 
establishing an adequate objective concept of these qualities based on empirical 
operations. The conceptual framework is often absent. 

When a scale of measurement for a quality such as utility or standard of living 
is established, there remains a fundamental problem of establishing that the 
measure and concept correspond. For example, index figures that are often 
prepared for the purpose of measuring standard of living are disputed by those 
whom they do not suit, as not measuring what they feel standard of living 
means. 

The empirical operations involved in establishing scales of measurement in 
the social and behavioural judgement, commonly involve responses by human 
observers. These are, for example, required to judge whether two stimuli, such 
as light intensities, pitch of sound, etc., are undistinguishable. As another 
instance they are used to give an ordinal rating to a number of alternatives. 
Although the data thus derived may be sufficiently consistent for a population 
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of observers to consider their objective, they are nevertheless subject to con- 
siderable statistical scatter 

The scales of measurement, the social and behavioural sciences bemg fre- 
quently based on determination of equivalence and order only, are then only 
monomial or ordinal 

Conjoint and multidimensional scales of measurement are commonly used 
They express some conceptual quality such as the ‘ahenation’ of a work force 
m terms of measurable quantities, such as worker days lost through disputes 
and absenteeism The difficulty again is the establishment of agreement between 
the concept of the quahty and the measures adopted 
There are no wholly axiomalized iheones in the social and behavioural 
sciences, which correspond to, say, classical mechanics or thermodynamics 
In conclusion it may be stated that m these sciences it is by no means uni- 
versally agreed that the clear formation of concepts in terras of empincal ob- 
servation, IS possible or desirable Nor is there agreement that the search for 
data, through measurement, advances knowledge and understanding The 
opponents of quantification would say that human nature and behaviour are 
too variable to enable the methodologies of the physical sciences to be applicable 
to them 


1.12 CONCLUSIONS, TRENDS, AND DEVELOPMENTS 

The conclusions, trends and developments are best outlined succinctly 
The interest and utility of fundamental measurement theory has been argued 
at the outset of the chapter Judgement must now be left to the reader 
There are a number of trends and developments in the field, as well as many 
unresolved problems These have been mentioned before, and they may now 
be usefully listed 

(a) There is a need for a further development of the treatment of uncertainty 
in fundamental measurement theory 

(b) The exploration of the relation between measurement and other forms of 
symbohe representation, such as descriptive natural language, is a poten- 
tially fruitful field of research 

(c) A theory of measurement based on axiomatized theories, rather than 
representation conditions for measurement scales of individual quantities, 
should be developed for the physical sciences 

(d) For the social and behavioural sciences it is right to pursue the Gahlei 
programme to attempt to measure that which is measurable and to render 
measurable that which is not This endeavour w^Il answer doubts about the 
feasibility and usefulness of measurement m these domains in one way or 
another 
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APPENDIX iJk: BASIC NOTIONS OF SET THEORY 
LA I Set, Subset, Fanulj of sets 

A set (or class) is a collection of elements For example the set of all forces 
Sets are denoted by A, B, , Q, 

Elements of sets are denoted by a, 6, , q. 

If /I IS a set and if a is a member of the set A we denote it by a 6 ^ 

A set A may be specified by the listing of its elements A = {a,, Oz, , a,}, or 
by specifying a property to be possessed by its elements, A = {A\P(a)}, the set 
of all a for which P(a) is true 

B is said to be a subset of A, if every element of B is an element of A we write 
B c. A 

A = B\I and only B <z A and A Z B 

An empty set contains no elements and is denoted by 0 

All sets are considered to be subsets ofa larger non>empiy set called the universal 

set U 

A set of sets is called a class of sets and is denoted by s/, 

Set Operations 

The union of two sets ^ and B,Akj Bis these! ofeleraents belonging to at least 
-4 or B The union of n sets A^, ,A, is denoted by 

The intersection of two sets A and B,Ar\Bi% the set of elements common to 
both A and B U A n B = 0 they are called disjoint 

The difference of two sets i4 and B, /I — B, is the set of elements of /I which are 
not elements of B In particular 1/ — /I is called the complement of A denoted 
hy A 

The Cartesian product of two sets A and B, denoted hy A x Bn the set whose 
elements of all ordered pairs (a, b) for which ae A,beB This can be extended 
to n sets Ai X A 2 X x A„ the set of all elements, n tuples (fli, 02 * » 

Qi eAi, etc, and denoted by lA, 

1,A3 Relations 

Relations and their properties will be defined m terms of sets 
ArelationRbetween elements ofsetSv4andBisasubsetofyl x B,R cAxB 
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This is a binary relation. An n-ry relation is c /Ij x • • • x We shall also 
denote an «-tuple which is an element of R by R{ai, a„). 

Let us denote R <= A x A and aeA,beA.Ris called a relation on A. 

The set of all a for which (a, b)e R is termed the domain of R, while the set of 
all b for whieh (a, b)sR is the range of R. 

The set of all elements o[ A x B which are not elements of R, is denoted by R' 
and defines the complementary relation of R. 

The set of all (b, a) for which (a, b) e R is the inverse relation of R, denoted by 
R-K 

A relational system (A, is a family of sets consisting of a set and a class of 
relations ^ defined on it. ^ is called the structure of the system. If y4 is a set of 
extramathematical entities and ^ are empirical relations defined on it the system 
is called an empirical relational system. If ^4 is a set of numbers and ^ are rela- 
tions on them the system is termed a numerical relational system. 


1.A.4 Special Types of Relations 

R is called symmetric if and only if whenever (a, b) e R, {b, a) e R. 

R is called reflexive if and only if for each a e A, {a, a) e R. 

R is called transitive if and only if whenever (a, b) e R and (b, c)e R then 
(a, c) s R. 

A relation R is called connected in A if for all a,b e A,a ^ b, either (a, b) e R 
or (b, a) e R. 


1.A.5 Equivalence, Order 

A relation R which is reflexive, symmetric, and transitive is called an equivalence 
relation. It will be denoted by ~. 

An equivalence relation ~, is a congruence relation for a relational system 
</}, if whenever a ~ b, a can be substituted for b in any Re3i. This is called 
the substitution property of the relation. 

Equality, =, is an equivalence relation between mathematieal entities, which 
has the substitution property with respect to all relational systems for those 
entities. 

A relational system <.4, ~, <> will be called an order system if and only if: 

(i) a,be A exactly one of the relations holds a b,a < b,b < a; 

(ii) ~ is an equivalence relation; 

(iii) < is transitive. 

An example of an order system is the numerical system <Re, =, <> for real 
numbers. 
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1>A6 Mapping 

A mapping of A into B denoted M /I -»■ B ts a rule, which assigns to each 
element Q e /4 an element of B, called the imAS® of a under M This element is 
denoted by Af(a) 

A is the domain of M and the set of images of the elements of A under M is the 
range of A/ 

If the whole of B is the range of Af the mapping is called onto M 

If different elements of A ha\e the same imagtf m B the mapping is called many- 

toone 

If each different element of A has a different uttag« the mapping is called one to 
one 

1.A.7 Homomorphism 

Gnen two relational systems </t, and <B. where A and B are sets and 

5? and ^ are classes of relations on >4 and B respectively, ^ = {R,. ,R,}and 

^ = {P, P„} consider that there is a mapping Af from A onto (into) B Af 

A-* B Then Af isahomomorphismfrom</4,^> onto (into) <B,^> if and only 
if for alia,, a^eA 

R/iOi* , It*) P.<A/(<J,), , A/ffli)) for I = I, ,n [o means implies and is 

implied by) 

I AJl Operations 

Let A denote any set Then a binary operation » on /I is a mapping which assigns 
to each pair of elements a, be A an element a » h 6 /I 
An operation is commutatiie if 

a o b = b o d 

An operation is associatiie if 

a o (h 0 c) = (a o h) o c 
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P. H. SYDENHAM 


Measurements, Models, and Systems 


Editorial introduction 

The physically existing world consists of numerous systems each having specific uniquely 
set parameters. Man’s models of these systems aim to be as general as possible so that they 
can be widely applicable. Thus mathematical and linguistic models require information 
that restricts them to the specific when they are applied to physically existing problems. 
This information is obtained by measurement. 

This chapter introduces the reader to the relationships existing between measurements, 
systems, and models of systems. It is intended to provide some understanding of measure- 
ment situations in order that an appropriate measurement strategy can be employed or the 
reasons for difficulties encountered might be explained. 


2.1 INTRODUCTION 

There has been, in the past, a general tendency for designers and users of measur- 
ing equipment to view a measurement within too limited a perspective. Efficient 
application and design of hardware requires of the user a broad understanding 
of the role that a piece of measurement hardware plays in the total system in 
which it is placed. 

Current understanding of the measurement process itself, having been 
summarized in the previous chapter, now enables an introduction to the 
relationship between measurements, systems, and models of systems. 

As measurements help us to produce models of real situations and as under- 
standing of systems is largely conveyed in terms of models of systems, this 
chapter will concern itself with a general introduction to the systems approach 
in relation to its relevance to measurement systems design and application. 
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appeanng to be so) mathematical models At the other extreme, such as m parts 
of the soaal sciences, m economics, in human geography, in environmental 
studies, and other disciplines of the soft-sciences (also called empirical sciences), 
the models so produced may be very vague in definition due to their complexity 
often even to the point of attempting to allow for the possible existence of as yet 
unexplamed systematic (not metaphysical) factors that produce a system, the 
whole of which is more than the sum of its parts (This is known as the gestalt 
concept ) 

Debate flows back and forth between both boundaries as each kmd of group 
attempts to make use of the concepts of the others, often without success This 
IS not the place to attempt to unify systems approaches— many experts have 
tned This chapter makes use of the wide range of such concepts because 
measurement problems span the whole field of human endeavour and some of 
the necessary understandmg of measurement systems is still in the poorly defined 
state 

Design and application of a measurement system begin with an appraisal of 
the measurement need extending design outward to finally produce equivalent 
signals at some convenient location It will be found m practice that situations 
in the hard sciences generally are relatively straightforward to realize con- 
ceptually because the parameters to be sensed are clear cut In contrast the 
situation for soft science situations can be such that it is not possible to decide 
what It IS that should be measured, nor bow to do it when it is identified 

This chapter provides an appreciation of a wide background of knowledge 
that IS generally helpful to measurement designers, helping them to better 
appreciate the generahties of situations met 

2 2 SIGNALS 

As will be discussed in more detail in Section 2 73, measurement mformation 
is conveyed m some suitable form via an energy or mass transfer link, the 
measurement information being conveyed by such a earner as a changing state 
or modulation of the carrier The changing medium used in this way is termed 
the signal 

Signals, therefore, can take the form of variations of some parameter of hght 
beams, of fluid pressure, of mechanical movements, and, the most used today, 
of electneal energy Figure 2 I is a useful chart of mformation carrying tech- 
niques 

Where the modulation conveys no useful or desired information it is termed 
a noise signal or merely noise Noise, when excessive, can mask the true signal 
providing false mformation The energy or mass medium carrymg the signal is 
embodied in the system instrumentation 

Once a measurement parameter has been transduced mto an equivalent signal 
the signal can be treated by any one of the numerous signal processmg methods 
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Figure 2.1 Common information-carrying methods. (Reprinted from Stein, P. (1969). J. Metals, 22, No. 10, 41. 
Courtesy of American Institute of Mining, Metallurgical and Petroleum Engineers (AIME)) 
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according to signal transmission concepts The prime requirement is that the 
one-to-one mapping made at the transducer is not altered by the transmission 
system before the signal arrives at its destination An exception to this rule is 
when the properties of the transmission system are deliberately used to com- 
pensate, in some way, for a known deficiency m the sensing stage or to extract 
only part of the conveyed information 


23 MODELLING 


23.1 Models Motor Ingenuity 

Models of various kinds are used extensively to provide representations of 
some aspect of the real-world system of interest They enable us to investigate 
the real situation without needing to actually produce it or modify an existing 
situation Models are developed for many reasons~to enable, for example, 
investigation of a given system’s behaviour, to enable a scenario to be con- 
sidered, to provide a more convenient medium of discussion of certain features. 
They provide, as was recently stated, means to motor ingenuity 
If a model were developed to represent exactly and completely a given system 
then clearly it is an exact replica In practice a model rarely is a complete 
representation, it only models chosen aspects of the system, this is further ex- 
panded in the following sections where various kinds of models are discussed 
Other published accounts on this topic are Finkelstein (1977) and Sydenham 
(1979) 


2.3.2 Linguistic Models 

The linguistic model uses the natural spoken language to express sufficient 
parameters, and their interactions, of the system of interest such that an adequate 
level of explanation, of mformation transfer, takes place In the empirical sciences 
and the arts this is the prime form for presenting models of situations and 
relationships 

Its major shortcoming is the lack of exactitude and speed of communication 
that can be reached within a given level of effort It serves as the first level 
of modelling, providing for reasonably efficient communication between 
persons, especially in the area of expression of subjective and emotive concepts. 
It can rarely serve the hard-sciences as well as it has been adopted and used in 
the soft-sciences It often plays a complementary role along with the pictorial 
methods, providing descriptions of parameters It is the semantic feature of many 
words, and groups of words, that makes resolution, of such matters as differences 
of interpretation of specifications, so often intractable 
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2.3.3 Iconic Models 

An icon (ikon) is an image, figure or pictorial representation of a concept. 
Iconography is the description of a subject by means of drawings or figures. 

In engineering and the sciences iconography is used extensively; indeed to 
such an extent that most communication of ideas uses it. The importance of 
pictorial representation in the history of technology has been studied (Ferguson, 
1968). 

Examples of iconography are the use of three-view engineering drawings of 
objects, circuit diagrams, block schematic diagrams, isometric projection 
drawings, and graphs of various kinds. Graphs are singled out as a class in 
Finkelstein (1977) because of the conceptual difference that they represent 
performance relationships rather than the more direct physical relationships of 
make-up. Frequency response plots, signal flow diagrams, flow charts, signal 
graphs to show system nodal interconnections, activity charts, and production 
resources evaluation charts are each examples of the graphical form of icon. 

The iconic level of modelling is usually reached very early in any design or 
discussion of a measurement system problem. Lines are drawn appropriately, 
their geometrical relationships and linguistic annotations conveying the aspect 
of information required. 

These considerably motivate the mind and certainly assist such matters as 
construction and assembly. However, they do not provide the same depth of 
objective rigor as do the next class of model to be considered— the mathematical 
model. 

2.3.4 Mathematical Models 

Mathematics is in itself a modelling discipline for it enables relationships 
between defined quantities to be expressed in terms of representative mathe- 
matical symbols and statements. 

Mathematical models can be extremely powerful design aids. In many areas 
this form of model expresses fundamentals of systems at a conceptual depth 
well beyond that which can be realized as practical systems. As an example it 
was by way of a mathematical model of electromagnetic fields that Clerk- 
Maxwell was able to show that, at the time, unobservable radiation can take 
place under the correct conditions. Mathematical models are also able to be 
run faster in time than the physical system they represent. They enable a degree 
of prediction. A subject can be considered to have reached a level of maturity 
when it can be modelled mathematically to an adequate level. Often the effort, 
which can be considerable, to produce the mathematical model is a worthy 
investment. It was the 19th century philosopher Boole’s work that produced 
Boolean algebra which subsequently enabled the ubiquitous physical digital 
computer to be developed according to a rigorous design basis. Section 2.5 deals 
with the generation procedure for these models. 
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23^ Phj'sical Models 

It IS often con\ement to construct physical models of a situation Examples of 
these are the scaled-down \ersions of a proposed process plant ora model of a 
de\ice for seeking patent appro\aL These clearly, again, only model selected 
features of the whole Gnen that any s>stem of interest has numerous aspects 
It can be seen that there could easil> be more effort and matenal construction 
expended on models than would exist in building the real sjstem itself Ex- 
penence is needed to decide which kmd of model is the most appropnate and 
which kmd will provide the greatest increase in understandmg for a given level 
of effort mpuL 

It IS to be expected that the more sophisticated the model the wider its 
apphcabilitj and usefulness 

Information, signal, and power flow models are distinguished as important 
to the understanding of measurement s>stems in Fmkelstem (1977) 


2.4 SYSTEM STRUCTURE 

As stated in Bosman (1978) s> stems can be reticulated or box-cut, a process m 
which the whole is, for purposes of analysis, broken down mto more detailed 
assemblies of subsystem blocks, the process being taken as far as is needed, 
perhaps finally to the basic elements So much has been written about systems, 
by so many persons, covenng such a wide range of applications, as to make the 
umversal unification of lenmnology perhaps impossible to achieve With 
respect to instrument systems, Bosman (1978) and Mesch (1976a, b) appear to 
be the only guidance statements made on the conjoint subject formed between 
system theory and measurement systems design In the mam, Bosman discusses 
it m terms of levels within extensile systems whilst Mesch deals with typical 
block diagram transfer function mterconnections and the system state Mathe- 
matical modelling of instruments in particular is covered m Fmkelstem and 
Watt (1978) and m Volume 2 of this handbook. 

In the summary to Bosman’s paper he reticulates an instrument system into 
subsystems and partial systems The former concerns ‘separately identifiable 
conglomerations of equipment (instruments) with connected (partial) functions 
w hich are logically geared to the system’s mam goal' He defines partial sy stems 
as a genenc part concerning one aspect Further partitioning arrangements 
suggested are toconsidernspects (examples are performance, operation, physical, 
economic) and structures (functional, organic, activity, information, soao- 
lechnical are suggested) 

The approach adopted by Mesch is somewhat complementary to that of 
Bosman as it provides another aspect of system reticulation Pnncipally it is 
based upon the fact that subsystem blackboxes can be expressed as mathematical 
transfer functions mterconnected to form the whole 
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(b) 



Figure 2.2 Example showing (a) given block diagram and 
(b) the reduced single-loop form 

Commonly met configurations of transfer functions are the serial or cascade 
chain, the parallel connection, and the single feedback loop. These are dealt with 
in Chapter 4 (Section 4.3 in particular) and in Section 14.3, where systems are 
also discussed. 

An algebra of block diagrams has been developed (Di Stefano et al, 1967); 
it is useful for handling more complex situations as it provides means of possible 
reduction (Figure 2.2) or as a means to isolate a block. Surprisingly, a small 
number of structural arrangements suffice for much of instrument design, this 
being because adequate design can be done using reduced models wherein only 
the dominant transfer functions need to be retained in the final model. 

A transfer function is a mathematical model of an input-output relationship. 
The physical form of the hardware that provides that relationship is not em- 
bodied specifically in a transfer function. It is, therefore, often possible in 
practice to produce modelled realizations in more than one energy domain. For 
example the transfer function for a second-order system, can describe, in the 
mechanical domain, the time-velocity behaviour of a mass and a spring, in the 
electrical domain the behaviour of current in an inductance-capacitance circuit 
plus other arrangements such as behaviour in a fluid medium. This universality 
has become known as theanalogies. A general understanding of this commonality 
in physical systems is to be obtained from Shearer et al. (1967). As this is discussed 
at some depth in Section 4.3.2.2, it needs no further expansion here. 

2.5 DEVELOPMENT OF MATHEMATICAL MODELS 

Mathematical models tend to be those one strives for ultimately because of the 
in-depth insight they bring to a subject. It is, therefore, of value to describe 

riefly the steps taken in treatment of a topic where this form of model is sought. 
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The first step is to consider the subject to be modelled from any and every 
aspect that appears relevant Considerable experience and skill assists this vague 
initial step along As features, aspects, relationships and the like, are realized 
they are recorded as some kind of model, generally as symbolic black boxes in 
which will be at least a linguistic form of description For example, a temperature 
sensor would be a black box having temperature labelled as the input and some 
form of equivalent signal output The box could be labelled ‘temperature 
sensor 

A second step (in reality any step of the process that might be advanced as 
progress is enabled by acquisition of new knowledge) is to sort the general 
information into systematic classes aiming to identify the various black boxes 
and their interconnections Figure 2 3a illustrates this first step which leads to a 
schematic block diagram m which links are formed and labels given to individual 
blocks 

This iconograph is then continually refined and broken down until it is 
possible to see that individual blocks are at such a level of simplicity (Figure 
2 3b) that they can be assigned appropriate general mathematical input output 
relationships (see Figure 2 3c) Linear equations are preferred but non linear 
expressions can also be handled For example, the second*order, spring-mass 
system mentioned earlier would be represented as a box in which the transfer 
function IS that of the second>order linear differential equation 
In many cases the arrangement of boxes and the equations selected as the 
model may only provide the overall transfer function sought with internal 
operation being quite different at theiK>des(orpom)ofthe system It is important 
to make the distinction between modelling a system at only a total transfer level 
and as one in which internal operation is also to be modelled 
A model, thus made, is relevant to a very wide range of physical manifestations 
for the numerical coefficients of the various equations selected are not yet 
identified To bound the general model into an adequately specific one, it is 
necessary to convert all such parameters into the numerical state These param- 
eters are, m fact measurement variables that either have already been evaluated 
(such as the mass of an electron, the value of gravitational attraction or a con- 
version coefficient of a sensor) or need to be measured m order to bound the 
model 

Thus is created a mathematical model of the system of interest However, to 
stop here is hazardous for models must be proven, evaluated in some way to 
ensure that they do indeed represent the reality desired 

One testing method, of course, is to build the model in its real-world physical 
form and compare its performance to excitation inputs with the original system 
This IS often not practicable because of such factors as the cost is not warranted, 
the system cannot yet be built, or the model must run at a different time rate 
The concept of the analogies provides a means of testing for it enables 
analogous systems to be constructed on analog or digital simulators 
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Recorder-cont roller 
set with desired 
temperature as reference 


Actual temperature 
measured 


desired temperature 


(b) 




Measuring 

element 


Figure 2.3 Stages in development of mathematical model of an 
example temperature control system: (a) physical layout in block 
form with descriptions added; (b) system broken down into re- 
cognisable blocks; (c) mathematical equations added with co- 
efficients ready to be identified. (Adapted from Coughanowr and 
Koppel (1965)) 


Usually the first prepared models are proven to be inadequate in some respect 
and the process of refinement continues until one runs out of ideas, effort or, 
more optimistically, until the model provides an adequate level of simulation. 

At this point, provided the limitations of the model are well known, the model 
can then be used as the basis for further, more efficient, study of the original 
system. To cite an example, mathematical modelling of an inductive sensor 
enabled the modeller to investigate frequency of operation, interconnections, 
and effect of physical geometry on the sensitivity without the need to actually 
build a single device. 
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The process of mathematical model building is explained in Coughanowr and 
Koppel (1965) m relation to general process control systems In a specifically 
instrument sense see Fmkelstein and Watts (1978) and Volume 2 The aspects 
of fitting equations and sizing coefficients is a major part of systems identification 
and parameter tsimaiion—TtkT to Chapter 8 for a survey of the latter 

Creating a mathematical model rests very much upon adequate a prion 
knowledge being available for the model is build from this In many disciplines, 
especially those of the empjncal sciences, although it is recognized that more 
knowledge is needed and m what area, the practical reality can be that the data 
needed cannot be obtained 

Many authors have wnlien on methods of modelling badly defined systems 
but in general it must be said that considerable effort has been expended on such 
problems with less than desired returns A review of this situation is given m 
Young (1978) There, the author also presents his generalized approach to the 
modelling of badly defined systems from the holistic viewpoint, one in which 
one attempts to produce a model from knowledge of the intact system The 
alternative approach, often adopted is that according to the reductionist 
philosophy wherein the real system is philosophically broken down into sub- 
systems which are explainable in mathematical terms The catch ts that the a 
prion knowledge is often insufficient, thereby allowing a breakdown to be made 
as the observer believes it should be rather than what it might actually be 
Young presents examples of his suggested systematic procedure applied to 
several badly defined systems A guide to development of models m general is 
also given 

The process of creating mathematical models is often aided by the use of 
graph theory In this approach the simultaneous equations of the system are 
represented by iconographs formed from lines that graph signal flow Figure 2 4 
is an example An introduction to signal flow graphs is given in Di Stefano et al 



(b) 



Figure 2.4 Graph theory applied lo a 
feedback loop (a) block diagram, (b) signal 
flow graph 
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(1967) where the various rules of the algebra, definitions, and construction are 
dealt with in a tutorial manner. Many texts exist on graph theory, of which 
Mayeda (1972) and Henley and Williams (1973) are written from the engineering 
viewpoint; the former also contains an extensive bibliography. Graph theory 
found earlier application in the management and operations control of engi- 
neering enterprise. It began to be applied to physical systems modelling in the 
mid-1960’s, a time at which a branch, that of bond graphs, emerged to handle 
multidisciplinary system elements. Details of this approach are to be found in 
Karnopp and Rosenberg (1968), Karnopp (1974), and Karnopp and Rosenberg 
(1975). 


2.6 THE PLACE OF MEASUREMENTS IN SYSTEMS 

A key purpose of a measurement is to map a physical parameter into (usually) 
an equivalent number set. In many instances that is all that is needed, but there 
also exist many situations where the measurement signal is used to continually 
control a process. It is rarely appropriate to view a measuring instrument as a 
stand-alone artefact. It is important to know its use in relation to the specific 
extensive system of which it forms a part. Figure 2.5 depicts one author’s model 
of the entire process (Sydenham, 1979). 

A measuring system comprises a sensing stage, in which the original parameter 
to be measured, called the measurand, is transduced into an appropriate equiv- 
alent signal. The sensor’s role is to extract specific information, to act as an 
information filter, passing information on the state of a particular chosen 
parameter existing within a possibly infinite set of definable parameters that 
totally describe the system. 

A measuring system, as well as being able to convey internal messages— the 
signal— without loss of accuracy must also, overall, map variables in a faithful 
manner. This line of thought is expanded in Sydenham (1979). 

Measurements of actual systems— and no other kind exists— provide 
numerical values for the state variables, thus providing specific bounds to 
generalized situations. Any mathematical modelling exercise must eventually 
require measured data when it needs to be applied. In many cases generalized 
mathematical models need some coefficients to be limited numerically in order 
to allow reduction of an otherwise intractable general solution. 

2.7 MODELS OF THE MEASURING INTERFACE 
2.7.1 The Set Theory Model 

The previous chapter provides a statement on current considerations aiming to 
find a basic rigorous mathematical model for what occurs at the interface 
formed between a sensor and the system to which it is connected. 
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Physical slate set MeosuremenI Representative symbol 



Figure 2.6 Set-theory model of a measure- 
ment (From Finkelstein, L. (1975). Measure- 
ment and Control, 8, 105-11. Reproduced by 
permission of Institute of Measurement and 
Control) 

That model is based upon set theoretical considerations. The iconographic 
representation of the mapping process is given in Figure 2.6, after Finkelstein 
(1975). This model of a measurement appears to be the most fundamental and 
being mathematical, it should ultimately pave the way to machine decision- 
making about the design of a measurement interface. At present, however, it is 
doubtful if many practising measurement systems designers would find it of 
great use as an aid to design. 

From it we can recognize certain of the features demanded of a measurement 
system. The need for equivalence between the real scale and the mapped one is 
evident as is also the need for a mapping process for each stage parameter to be 
measured in the physical system. 

Although they may eventually be seen as no more than learning aids, two 
other models are available that also stimulate thought about design. 

2.7.2 The Popular Definition Model 

Most texts on measurement explain what a measurement is in terms of compar- 
ing the unknown quantity against a defined standard for that kind of quantity 
which has embodied in it some form of subdivisional scale. Figure 2.7 shows how 
this can be represented in terms of measuring the length parameter of an object. 


1 metre 


Standard 
of unit 


0. -1 2 -3 4 5-67 .8 -9 1:0 

_l I I I I I I I I I I 


Unit IS the 
metre 


Unknown 

(measurand) 


0 74 m long 


Subdivision of unit 
forms the scale 


Unknown IS meosured by comporison with defined stondord using a 
scale to subdivide the unit. 


figure 2.7 Popular definition representation of a measurement (From 
Sydenham, P. H. (1979). Measuring Instruments: Tools of Knowledge and 
Control. Reproduced by permission of Peter Peregrinus Ltd.) 
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Like ihe set-theoretical model, discussed above, it also indicates the need for 
an agreed standard unit for the parameter and a way to realize it It also makes 
apparent the need for a method by which the standard and the unknown are 
compared Also revealed is the problem of what to do about any differences 
between them that lie outside exact integer equality 
Many measurements are not easy to conceptualize in such terms, for example 
the thermodynamic gas scale for temperature, a kind of variable that does not 
algebraically add to produce a larger value, and the even more poorly defined 
concepts such as conflict, love, and pain 


2.7.3 The Information Selection Process Model 

The term information has two different usages In common language it relates to 
a collection of facts, ideas, entities, concepts, and attributes that define a subject 
or object— for instance an encyclopaedia is ‘full of information’, it contains a 
host of meaningful statements 

In the information theory sense used by communication engineers it is con- 
cerned with the quantity conveyed in a message passing through a communica- 
tion channel In this sense the conceptual idea of meaning of that message is not 
catered for by the theory An apparently nonsensical message (for example, a 
coded message) can be transmit!^ with utmost fidelity 
Thus a measuring system is concerned with both kinds of information It 
must map the variable (that is, codify the measurand) and also transmit it 
according to information theory 

Philosophical thought m this direction is scant, especially by measurement 
systems designers One writer (Stem. 1970), suggests that an object possesses 
latent information, that is ready-to-be-tapped information, that the sensor 
selectively filters out into the measurement channel Based upon the information 
concept of Stem it is possible to generate an informational aspect model of the 
measurement interface Referring back to Figure 2 5, it can be seen the sensor 
connects to the system in order to provide an energy or mass transport link. 
We know from experience that measurement data flow on such carriers but we 
have little idea of the quantitative relationship between them (Sydenham, 1979) 
Nevertheless this model shows clearly that the act of measurement must bring 
about an imbalance of original energy or mass balance conditions 
It IS, therefore, necessary to create sucha link m order to obtain a measurement 
and the imbalance must be either negligibly small or well known so that it can 
be compensated One measurement method adopted is to adjust the energy 
flow until It ceases, this is called the null-balance method The conditions of 
balance then constitute the measurement at that point 
The information model also shows how stray noise energy can be so 
disastrously influential on good measurements 
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2.8 KINDS OF MEASUREMENT SITUATION 

2.8.1 Interaction Between the System and the Measuring Stage 

The measuring interface comprises the system of interest and some kind of 
measurement sensor that codes meaning to what is sensed. 

Four classes of this can be identified as an aid to understanding what might 
take place. The basis of the class into which to place a specific situation is based 
on the rigidity of the code associated with the meaning allocated. 

Class 1. This is the case when the designer has created a hardware sensor and 
applied it to produce data to which a clearly defined meaning has been stated. 
An example might be the act of placing an electric thermometer into a water 
bath, recording the results as indicating changes in bath temperature. 

In essence someone has coded the data to mean the temperature unit with 
some kind of scale. The code is rigidly applied; it is thereafter assumed, until 
recalibration, that the code remains unchanged. Clearly there always exists a 
constant danger that malfunction or an additional noise source could occur 
altering the code. For this reason many measurement systems will have a 
periodic self-checking feature. 

It is important to realize in this class that whatever happens to the measure- 
ment link it does not, once set up, materially alter the system, this being the 
key feature of the classes that follow. 

Class 2. When a scientist or engineer wishes to learn about a process a sensor 
is applied having, at the onset, an ascribed mapping code. 

The process is unaltered by the application of a properly designed sensor but 
here it is the observer who changes as observation continues. In many cases the 
knowledge obtained by the act of measurement alters the observer’s concept of 
what is being measured. The meaning, which is ascribed by the user in the first 
place, can unwittingly be changed as measurement proceeds. This situation 
continually arises in the use of measurements to seek new knowledge. 

Class 3. When two or more observers communicate using their natural senses 
and data processing ability the situation of class 2 extends to both systems 
possibly changing as observation of each other continues. 

This totally interactive state is regularly met in human communications and 
is just being met between humans and the semi-intelligent computing devices. 
It is to be expected that the occurrence of this kind of problem will increase as 
time passes and more intelligent machines are produced. 
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Class 4 The logical extension of class 3 is to the state where two man created- 
hardware intelligent systems sense each other and act accordingly This already 
arises in the interfacing of computational networks 


2.8.2 Access to System Measurement Nodes 

This section deals with another kind of measurement situation that arises, that 
of access to measurement data points of a system 
Se\eral situations will be met in practice when measurement is desired 

Accessible real parameters Of the potentially infinite parameters that might 
need to be measured only certain of them will, at any given time and state-of- 
the-art, be accessible A part of the practice of the scientific method is very much 
a pursuit of developing means to make parameters accessible 

Inaccessible real parameters Although it can be reasoned or, by indirect 
method, experimentally shown that real parameters exist that would be worth 
measuring, m some cases it is not possible to do so for practical reasons As an 
example measurement of many of the human physiology parameters is not yet 
possible under the normal operating state conditions because measurement 
methods would cause permanent damage 

Inaccessible unreal parameters Models can as has been said earlier, generate 
state parameters that have no real physical relationships to the system modelled 
Thus any amount of direct measurement effort will not lead to measurements 
being made The data sought, however, might be obtained by indirect methods 
As a simple example the physical real electrical circuit inductor is often modelled 
as a pure inductor in series with a pure resistor It is not possible to measure the 
voltage at the central node directly The values can however, be deduced by 
properly constructed separate inductance and resistance tests or by use of a 
dual control alternating current bridge 
In each of the above cases it is tacidly assumed that suitable parameters could 
be defined In many situations defining a unique singly variable parameter is 
the key problem Often a many-to one mapping is attempted to overcome this 
Chapter 7 discusses this in terms of the generalized topic of pattern recognition 
Another difficulty met might be that although a suitable parameter has been 
identified and made accessible by use ofthe appropriate sensor, the all-important 
standard of the unit is not held constant Examples of this arise in the discipline 
of econometrics, the study of measurement in economics Analysis methods have 
been developed there that assume that the standard is changing in a definable 
statistical manner In the physical science situation there is a tendency to ignore 
the likelihood of instability of the standard once it is declared 
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Chapter 

3 P. H. SYDENHAM 

Standardization of Measurement 
Fundamentals and Practices 


Editorial introduction 

The preceding chapters began this handbook by laying down an understanding of the 
measurement process and its relevance to practical systems of both physical and empirical 
science nature Measurement of the physical parameters has, in general, been accomplished 
more easily than in the empirical sciences and for this reason has matured at a faster rate 
in Its procedural and terminological areas 

This chapter is concerned with establishing a proper understanding of standard pro- 
cedures, practices and concepts in the areas of nomenclature, physical standards, and 
standards of specification It rounds out with a section describing the relevant institutional 
activities that c\ist to generally aid the practice of measurement Many of the concepts, 
although not >ct clearly applied to measurement practice m the empirical sciences, are 
ncscrthclcss largely relevant to those more complex measurement situations 

The material presented in this chapter is of key importance to good measurement 
practice It is an area reasonably well developed in standardizing institutions but one that 
IS yet to be taken up at an adequate level by the majority of measurement scientists and 
technologists Use of common linguistic terms and set procedures would do much to 
improse interchange of knowledge 

3.1 INTRODUCTION 

The basic concepts explained in the previous chapter are applied in many 
dilTcrent disciplines, in many different ways. Putting them to general practical 
use preferably requires the establishment of agreed procedures which are 
controlled by the use of agreed primary physical standards of the various defined 
units This, in turn, creates a need for standardized nomenclature to describe 
the concepts concerned. 

As well as standards for the physical units there exists another kind of standard 
that relates to practices and specifications. These standards also possess agreed 
nomenclature 
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Although measurement science has not been seen in the past as a clear-cut 
discipline existing in its own right there has nevertheless, emerged over the last 
century a considerable amount of standardization of its terminology and of the 
physical and specification types of standard This chapter provides an introduc- 
tion to these aspects and to the organizations, procedures, and systems of 
administration used to help ensure that these standards progress toward 
becoming common methodologies As will be shown, continual progress is 
necessary This introduction will help make it clear as to how to proceed with 
such matters and how to make allowances for the variations that exist and will, 
no doubt, continue to exist 

The subject matter that needs comment divides appropriately into groups on 
nomenclature (Section 3 2), classification of measurement science knowledge 
(Section 3 3), physical standards representing the agreed units (Section 3 4), 
standard specifications (Section 3 5), how standards are put to effective use 
(Section 3 6), and finally (Section 3 7) the institutions that are operating m this 
area Material given m other chapters, where more specific mention occurs, is 
also relevant 


3,2 NOMENCLATURE OF MEASUREMENT 


3,2.1 Standard Nomenclature of Measurement Science 

The terminology of a discipline develops as the discipline matures Eventually 
a stage is reached where the practitioners decide it is time that the terms that 
they use to describe, in a linguistic manner, the concepts that they use frequently 
should be standardized to overcome the confusion that has prevailed If the 
discipline is followed by only a small number of dominant practitioners it is 
reasonably easy to bring about uniformity of nomenclature Measurement, 
however, is practised in all branches of science and industry and has been 
developing with little coordination as a described and recorded methodology 
for over two centuries As might, therefore, be expected, the terminology of 
measurement m use is far from fitting a sole common standard set of terms 
The problem is complicated by the universality of the use of the basic concepts 
in so many different disciplines, m many languages, and because there are so 
many terms involved No single and absolutely encompassing terminology has 
yet evolved However, the past few decades have seen the emergence of several 
dominant sets of nomenclature that cover the general terms and definitions 
involved m measurement It is not possible, nor sensible (because of the multi- 
plicity of standards existing), in the space permitted here to describe all terms 
defined Furthermore, they are in a state of constant flux This section will 
introduce the mam terms met and used, descnbing where the various issued 
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standards may be located. It is important to use the standard terminology 
defined for the task in hand. 

Firstly, measurement science is also often referred to as metrology this being 
the field of knowledge concerned with measurement. This statement immedi- 
ately raises another problem that exists; the use of numerous synonyms to 
describe the same, approximately equal, concepts. 

Standard terminology has been issued at all levels ranging from international 
down to within a single commercial organization or within a particular author’s 
text. 

The Organisation Internationale de Metrologie Legale (OIML) issued a 
document entitled ‘Vocabulary of legal metrology, fundamental terms’ in 1969. 
The official definitions are those expressed there in the French language. The 
British Standards Institution issued a dual language version in 1971 as PD 6461. 
The English language listings are not official but must suffice for a very large 
part of the world. This document gives an extensive list of terms that arise in the 
course of describing the methodology of metrology. Table 3.1 lists several 
relevant documents that give general metrological terminology. To a large 
extent many of the individual countries’ standard terminologies are based upon 
the OIML document mentioned above but there will be small differences. 


Table 3.1 Selected standards documents relating to nomenclature of general metrology. 
(See Tables 3.4 and 3.5 for explanation of the code letters; many are endorsed as other 

national standards) 


Document reference 


Title 


(OIML)PD 6461: 1969 

AS 1514. Part 1: 1980 

BS 5233: 1975 
BS 2643: 1955 

lEC 50(00): 1975 

lEC 50(05): 1956 
lEC 50(07): 1957 
lEC 50(12): 1955 
lEC 50(20): 1958 
ISO R3 1 series 


AS 1384: 1973 
AS 1633: 1974 
AS 1057: 1971 
AS 1929: 1977 


Vocabulary of legal metrology— fundamental terms (dual 
language version from British Standards Institution) 
Glossary of terms used in metrology— Part 1. Genera! terms 
and definitions 

Glossary of terms used in metrology 

Glossary of terms relating to performance of measuring 

instruments (endorsed as AS Z23) 

General index of the international electrotechnical 
vocabulary 

Fundamental definitions 
Electronics 
Transduct ors 

Scientific and industrial measuring instruments 

On base quantities, mechanics, heat, electricity, light, 

acoustics, physical and inorganic chemistry, atomic and 

nuclear, nuclear and ionizing radiations 

Transducers for electrical measurements 

Glossary of acoustic terms 

Glossary of terms used in quality control 

Glossary of terms used in non-destructive testing 
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Furthermore the OIML document does not adequately cater for modem auto- 
metrology practice (At the time of editing AS 1514 Part 1 is the most recent 
published revision of the terminology of metrology practice revision of the 
international level of standard is currently underway ) 

Numerous additional standard specifications exist that state the terms to be 
used in the various specialist areas of metrology In all, a considerable number 
of defined terms is available and it is the duty of the person or group involved 
to be familiar with those terms that pertain to areas of direct interest to them 
How ev er, many people who measure do not make use of these nomenclatures, 
because they do not work within any definite field, and therefore possibly lack 
guidance about which standards to use, and because they are not required to 
use them As an example, many journals that report measurement technique 
do not specify a nomenclature to adopt beyond specifying the use of SI metric 
units 

It is not reasonable to give an encyclopaedic glossary of the terms involved 
and the reader is directed to consult the relevant standard There are, however 
several terms that predominate m common usage and that appear throughout 
this handbook They are given here in the form stated in the BSI English 
translation of the OIML terminology (PD 6461) 

Tenns can be grouped into those concerning the characteristic features of the 
measurement process and of the instrument used, the kinds of errors arising, 
and the methods of measurement These are now discussed m turn 

3,2 2 Nomenclature of a Measurement and of Measurement Performance of 
an Instrument 

Three distinctly different concepts arise when conducting a measurement or 
descnbing the performance of a measuring instrument They are often used 
incorrectly as synonymous terms 

The first relates to the resolving ability of a measurement process This is 
commonly called the cesobif.vaa b'H. ite. PD does tvjl, vo. fact., 

include this word, discrimination being given as that word to be used to describe 
‘the quality which characterises the ability of the measuring instrument to react 
to small changes of the quantity measured’ The quantity to be measured is 
usually called the measurand in English speaking countries, but this term also 
does not appear m the PD 6461 document 

Ability to discnmmate can be enhanced by increasing the gam of the sensing 
stage For example, a magnifying glass can be used to sight the position of the 
index mark against a ruled scale with greater resolution than by the unaided 
eye, an electronic amplifier can be used to enlarge an electrical metrological 
signal Too often this figure is quoted alone, users wrongly assuming that it 
related to repeatability and accuracy Having adequate discnminalion is a 
necessary but not sufficient condition for a satisfactory measuring instrument 
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Discrimination should be set so that the values obtained for the steady-state 
measurement value vary a little, thereby indicating that the apparatus is 
responding. Too great a discrimination level will provide excessive fluctuation 
of the signal requiring excess capacity data display or collection. A well designed 
instrument has its discrimination tailored to suit the task. It is usually relatively 
easy to obtain the discrimination level needed, gain no longer being a problem 
in instrument design. 

A very common error made is to quote the discrimination value in a way that 
suggests the instrument possesses that degree of repeatability and even accuracy, 
the next two terms to be defined here. The three terms, discrimination, repeat- 
ability, and accuracy, are quite different descriptions. They are not synonymous. 

The ‘quality which characterizes the ability of a measuring instrument to give 
the same value of the quantity measured, not taking into consideration the 
systematic errors associated with variations of the indications’ is defined in the 
OIML document as the repeatability. The term precision is very widely used to 
describe this attribute but, again, the OIML standard does not mention the 
term. 

It is important to distinguish between the closeness of values that define the 
repeatability of the individual, or the group of values, when measured in the 
short term with the same apparatus (as defined here) and the same parameter 
determined by a long-term set of measurements or by different persons with 
different apparatus. This latter case is called the reproducibility. 

A considerable part of published accounts on methods of handling data to 
decide such parameters as the repeatability are based implicitly upon the premise 
that the same measurement is made several times providing a set of data that 
can be operated upon in some statistical way. It is, however, often the case that 
only one measurement can be made. Methods exist that will enable the observer 
to estimate the repeatability, or reproducibility in this instance. 

A third situation needing clarification also arises when the instrument, the 
observer’s performance, and the external perturbing parameters (called the 
influence quantities) are each constant, variation in the values being caused by 
changes of the measurand itself. This is the repeatability (or reproducibility) of 
the measurement rather than of the instrument which was discussed above. In 
practice the two sources of variation occur simultaneously making analysis or 
reduction of influence effects complex. 

Instruments with adequate discrimination and repeatability can often be 
entirely useful instruments. However, there are circumstances where the third 
attribute, accuracy, becomes necessary. Accuracy of an instrument is ‘the quality 
which characterizes the ability of a measuring instrument to give indications 
approximating to the true value of the quantity measured’. It is an expression 
of the truthfulness available: lack of accuracy arises from both the instrument 
and the imperfect standard of the unit. A bent pointer on an indicating voltmeter 
will give the same level of discrimination and repeatability as one not having a 
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bent pointer but the reading observed may be in error from the true defined 
value for the volt It is well to stress here that no instrument yet made is absolutely 
accurate at indicating the value of a measurand It is technologically impossible 
to obtain the absolute case An instrument that is defined as providing the 
primary standard of a unit can be such but the influencing quantities and the 
intercompanson process that must be used to make a determination of a 
measurand with that apparatus will always introduce error A philosophy 
should be adopted whereby instruments are regarded as ‘not good enough until 
proven otherwise’ Measurement is a science of understanding errors as much 
as measurements 

The accuracy of an instrument depends upon knowledge of how indicated 
values relate to the agreed standard The process which ties accuracies from the 
used instrument to the primary standard is called traceability and is covered 
below m Section 3 6 2 Accuracy is finally assigned to an instrument by agree 
ment it does not automatically arise from good design alone The term accuracy 
is often confused with another, hneani), which expresses how values lie on a 
linear, proportional scale The scale might be very linear but biased in slope or 
offset from the assigned value and, therefore, not be accurate 
Accuracy becomes important when many different user groups make use of 
the same declared unit for expressing and applying their results They will not 
be working m a consistent manner unless each has his equipment set to read the 
same value of output as other users’ equipment for the same input as their 
equipment Consider, for example, the manufacture of parts for a motor car 
that come from different counines and must fit together This implies that mea- 
suring instruments must generally be routinely compared against a common 
standard of the unit, a process called calibration 
In order of ease of attainment discnmination is usually the most easily 
procured feature m an instrument, repeatability comes next with accuracy, very 
much harder and more costly to obtain, in a third place 
Other terms that describe the charactenstics of instruments include transfer 
function, sensitivity, amplification, gam, hysteresis, magnification factor, 
damping, response time, dnfl, and stability many of these are not defined in the 
PD 6461 document but can be found described in other documents given in 
Table 3 1 

3,23 Nomenclature Describing Errors 

The next group of terms m common usage is that group concerned with de- 
scription of the errors arising in the measurement and m the mstrument 
Measurements are never perfect, errors occur as deviations from the perfect 
case They arise from numerous sources but can be placed into one of two dora- 
mant general groups systematic or random errors 
Errors result in the measurement determination having associated with it a 



STANDARDIZATION OF MEASUREMENT FUNDAMENTALS AND PRACTICES 


55 


certain level of uncertainty about the value obtained. The range within which 
the true value lies is termed the uncertainty. In many cases the limits of range 
must be estimated; statistical probabilities can be stated where the case allows 
it, which is not always so. 

Past practice has tended to quote a measurement value followed by the 
numerical value of the error band, calling this latter statement the error of the 
measurement. Correctly it is the uncertainty of the measurement; the true error 
may actually be less but the observer can only be certain that it lies within a given 
range. 

Systematic errors are those that can be predicted, from past knowledge of 
the operation involved, on an individual measurement basis. For example the 
voltmeter mentioned earlier may have a bent pointer but calibration can deter- 
mine the amount of bending allowing a correction to be made to all values 
subsequently taken with it. The cause of a systematic error may be known but 
may not have been eliminated for reasons of expediency or impossibility. Such 
errors can also come from an unknown source but may have been established, 
as a predictable function of certain known related variables in an empirical 
manner. An example of the latter case is the use of curve fitting to an experi- 
mental determination of the phenomenon which allows a correction to be 
calculated for future readings. This curve is termed the calibration chart. 

Random errors are those which cannot be predicted on an individual basis 
but for which a statistical method can yield information about the mean value 
of a set of data using the theoretical laws of probability. An example is the level 
of white electrical noise at any required instant of time. Being truly random in 
nature it is only feasible to predict the mean of a run of signals or set of values, or 
to stale a probability of the next instant of signal having a certain given value. 
The nature of random signals is often assumed to be Gaussian but this is not 
always the case. If it is not so then there is a strong chance that most statistical 
methods used are not entirely applicable (Feinstein, 1971). The handling and 
manipulation of errors is the subject of Chapter 6. 

The two groups of error, systematic and random, are formed from many 
subclasses of error. The various terminologies used to distinguish errors include 
error of measurement, systematic error, random error, parasitic error, error of 
method, observer error, parallax error, interpolation error, error of indication, 
repeatability error, rounding error, discrimination error, hysteresis error, 
response error, datum error, zero error, intrinsic error, influence error, tem- 
perature error, supplementary error, total error and so on (refer to AS 1514 
Part 1, 1980). 

3.2.4 Nomenclature Describing Measurement Methodology 

Although all measurements are made by comparing the measurand against a 
defined standard in some way, there exist many ways to achieve this end. The 
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methodology of makmg measurements ako has a defined terminology that 
enables efficient communication of concepts by the use of generally accepted 
terms 

A direct method is that method ‘by which the value of a quantity to be mea 
sured is obtained directly, without the necessity for supplementary calculations 
based upon a functional relation between the quantity to be measured and other 
quantities actually measured Some confusion may arise between this definition 
and the direct comparison method m which the desired quantity is obtained 
at Its full value by comparison with a quantity of the same unit For example, 
reading the length of a piece of metal stock by reference, using direct companson, 
against a graduated rule The required measurement is m the same units as that 
of the scale used 

An indirect method of measurement is that in which the parameter sought is 
gained by use of intermediate stages of different units which are linked m some 
positive manner As an example the method of measuring distance using the 
transit time of a pulse of radiation is indirect because the distance is calculated 
from the relationship linking the speed of light with the time of flight which is the 
actual measurand observed When the measurement made is based upon the 
base quantities (those agreed physical quantities not related to any other m the 
SI system of units) used to define the quantity the measurement is said to be 
using a fundamental method 

The comparison method of measurement is that ‘method of measurement 
based upon the comparison of the value of a quantity to be measured with a 
known value of the same quantity, or with a known value of another quantity 
which IS a function of the quantity to be measured’ Into this class of method is 
placed the direct comparison method already mentioned Another subset of this 
class IS the substitution method in which the measurand is replaced by a suitable 
quantity which is adjusted m value to brmg the indicator back to the value 
indicated mitially by the measurand The transposition method is also a direct 
companson method, one in which ‘the value of the quantity to be measured is 
initially balanced by a first known value A of the same quantity next the value 
of the quantity measured is put in the place of this known value and is balanced 
again by another known value B If the balance indicating device reads the same 
in both cases the value of the quantity measured is ^JaB' 

In the differential method comparison yields a slight difference between the 
standard and the measurand that is used to apportion a small additive value to 
the standard value Yet another comparison method relies on coincidence 
occurring between the standard and the measurand such as is used in setting 
the rate of a clock by observing the comadence of the clock cycles with a standard 
source of cycles The null method is somewhat similar to, but not identical with, 
the differential method In using the null procedure the measurand and the 
standard are adjusted until the difference between them is zero In this case the 
null indicator is not calibrated for the purposes of giving a small value to be 
added to the standard, it is there only for seeking the null condition 
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Two more comparison methods are the complementary method and the 
resonance method. In the former the value of the quantity to be measured is 
combined with a known value of the same quantity so adjusted that the sum of 
these values is equal to a predetermined comparison value. The other method 
is a comparison procedure in which a known relationship between the compared 
values of the same quantity is established by means of the attainment of a 
condition of resonance. 

Terminology used in describing aspects of the physical standards of units is 
covered in Section 3.4. Many terms used in the presentation of description of the 
static and dynamic regimes of an instrument or other system are introduced in 
chapters of Volume 2 pertaining to these aspects. 

Many works on measurement include glossaries ranging from the short to 
the long. These often include the basic metrology terms in addition to other 
terms in popular use. These, in general, are not tied to any particular agreed 
standard of terminology and their use is, therefore, probably less effective than 
their compilers intended. They are, however, of value in obtaining under- 
standing but they should preferably not be used as sources of published terms, 
the official standards being safer to rely upon. It is important to prevent pro- 
liferation of non-agreed terminology but this is easier said than done. Texts 
with glossaries include Beckwith and Buck (1969), Bell and Howell (1974), 
Foxboro-Yoxall (1972), Herceg (1972), NS Corp (1977), O'Higgins (1966), 
Stata (1969), and Sydenham (1974). Clason (1977) provides dictionary style 
definitions in eight languages. Dietrich (1973) defines the use of the uncertainty 
concept of quoting error; many other works are in print on the statistical 
manipulation of data for which many references can be obtained from the 
bibliography published by the Higher Education Committee of the Inter- 
national Measurement Confederation (IMEKO, 1980). 


3.3 CLASSIFICATION OF MEASUREMENT SCIENCE 
KNOWLEDGE AND PRACTICE 

Although a chapter of Volume 2 deals with the topic in more depth it is necessary 
at this stage to mention briefly the problem that must be faced in retrieving any 
aspect of the knowledge pertaining to measurement science. 

Information about the terminology used is related, more often than not, to 
the subject to which it applies. The knowledge of measurement science is 
scattered over the many areas of its application; only a little is to be found in a 
few specialist groups devoted to its fundamentals. It is, therefore, usually 
necessary to search for terminology in the various similar applications as well 
as the class in which general terms are covered. For example there exist many 
Standards about nomenclature in such fields as, say, acoustic noise, temperature 
measurement, and flow metering. The nomenclature of these overlaps con- 
si erably in concepts but often uses different words for the same concepts. Thus 
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there is often a large and confusing array of terms issued m several standards 
on the same topic 

If generally agreed classifications had been decided before the current 
confusion about the material of measurement science had developed— in the 
1900's— then, in all probability, it too would now be reasonably well compart- 
mentalized mto clear-cut divisions such as are found for the literature about the 
discipline of physics, for mstance The internal taxonomy of knowledge and its 
termmology for measurement science are only just beginning to become 
organized, this handbook being one of the first attempts to order the fundamental 
issues involved rather than stating their practical outcome at the current state 
of the art 

Information is stored in one or more of several genenc groups Consider the 
thermoelectnc junction It might have its information classified under the 
pnncipfe used, for example under ihermoelecincity It could also be placed with 
other simitar applications of the instrument or method, such as the thermo 
couple s use for temperature control Alternatively it may be found placed in a 
group representing uses of the same technological deiice, like a group on the uses 
of the thermocouple m measurements of many kinds Yet another group where 
information may be available could be under a classification on the basis of the 
energ) regime used , as an example the thermocouple would be found descnbed 
in a Vi ork on elecincity or on us subclassificauons Other possible classification 
locations that would need searching to estabhsh the relevant terminology and 
other information would be under systems concepts, because of the general 
mathematics concerned, industrial practice, because of the use of thermocouples 
m process plant, instrument design, and so on The number of possible places 
where one would certainly find defcitive information is great 

It should now be clear, from the above example of the thermocouple, that 
standardization of termmology and bterature storage for measurement science 
topics are not hkely to succumb to any simplistic, singular, standards The 
problem there is far greater than has been fac^ m bnngmg about a coherent 
system of primary and denved standards for the physical quantities 

information, given m Volume 2, will assist the organization of an effiaent 
literature search of hbrary stock and computer data bases 


3A UNITS OF PH\SICAL QUANTITIES AND THEIR DEFINING 
STANDARDS APPARATUS 

The fundamental concept of what a measurement is has been developed m 
Chapter 1 The common concept, that most close to the actual working face of 
the application of measurements, is that a measurand is compared agamst the 
defined standard, or something representing that standard, the difference from 
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the actual magnitude of the standard being expressed in subdivisions of the 
basic unit used according to some form of scaling. 

The standard is the physical object or characteristic of a physical apparatus 
that represents the conceptual unit chosen to represent a particular measurable 
attribute. For example, a particular piece of metal uniquely represents the unit 
of mass; here the unit is called, by convention, the kilogram. It is represented, 
physically at the primary level, by a sole piece of material maintained under 
defined and controlled conditions. This is the physical standard of the unit of 
measurement. 

Each unit and its standard are derived by man. The general philosophy 
adopted for the creation of the primary standards is that they be based upon 
some physical principle that is known to be as invariant as can possibly be found. 

In many cases the associated unit was defined before the principles now 
maintaining the standard were developed. For this reason many primary units 
appear to use very awkward values. For example, the metre is currently defined 
as 1,650,763.73 wavelengths of the radiation of krypton-86 gas established under 
defined, closely-controlled, conditions. This rather inconvenient number arises 
because the metre was formerly defined (in the same manner as the kilogram) 
by a metal bar of a given length, which, in turn, was based upon an unrealistic 
distance equal to one ten-millionth of the distance of a quadrant of the earth’s 
circumference. 

It is more expedient to retain any new standard’s value close to the magnitude 
of the unit adopted than it is to change the whole, traceable, chain of units 
whenever a new and better physical principle is discovered. At present the 
krypton length standard appears to be about to be replaced by the use of 
radiation from an iodine, or methane-stabilized helium-neon, continuous-wave, 
laser. Even this, however, may be overstepped by adopting the concept of 
defining length in terms of a chosen and declared value of the speed of light. 
This will then relate length to the physical standard of time instead of to its own 
unique primary standard principle. 

Most of the primary standards are now based upon natural physical principles 
but their units are still declared as man chooses. The values are refined as better 
knowledge is gained. In the past this has led to a bewildering range of units. 

The development of physical standards and their units has been a long and 
tortuous process (see Bendick, 1947; Kisch, 1965; Klein, 1975; Sydenham 
1979). Since earliest times rulers have realized the need for legal metrology but 
were generally unable to do very much about it beyond establishing the rules 
and standards: they could not adequately police adoption across large areas of 
land. National units of length and volume were implemented but it was not 
until comparatively recently that a coherent system of measurement units and 
their primary physical standards was introduced. This came in the form of the 
SI metric system, which is almost that sole legal system in use throughout the 
world. Bailey (1977) is a good introductory account about the various kinds of 
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Standard Sanders (1972) and Verman (1973) are two of the few published texts 
on the subject of standardization Other works of relevance are cited in the 
following section on standards of the speafication form 
The development and maintenance of the primary physical standards is a 
specialist area of measurement science It is one where the prime task is to 
develop maintain and apply apparatus that will provide definitions of the 
required base and derived units at the highest possible accuracy and reproduci 
bility Bell and Clarke (1975) on producing a standard for density (a seemingly 
easy variable) provide insight into the theoretical and practical levels of 
sophistication required Factors such as cost time to set up and make an 
observation size and portability of the apparatus ability to be mass produced 
for commercial sale and other factors important to the industrial user are of 
lesser significance in this case than ach eving the best possible metrological 
performance In each extreme the basic philosophy is much the same it is the 
emphasis that is different It takes many years to develop new standard apparatus 
and obtain agreement for its general use throughout the cooperating countries 
Figure 3 1 shows apparatus mainta ning the standard unit of time the second 
To maintain this standard is a measurement science task combined with 
fundamental physical research Page and Vigoureux (1975) provide ms ght m 
theiraccountof the first century of operation of the BIPM Pans theorganiza 



Figure 3 1 Phys cal apparatus used to ma nta n the primary frequency standard 
for SI un t the second n the Un ted States of Amer ca (Nat onal Bureau of Stan 
dards USA) 
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tion responsible for administration of the world’s legal units and standards for 
the SI system. The Bureau International des Poids et Mesures (International 
Bureau of Weights and Measures— BIPM) has been responsible for metric 
systems since the 1875 Convention du Metre was officially signed. 

Illustrated descriptions of the apparatus used to set up the primary standards 
of the international network are presented in Page and Vigoureux (1975); wall 
charts are available from the National Bureau of Standards, Washington, USA 
(NBS, 1976), and from the National Physical Laboratory, England (HMSO, 
1972). Cochrane (1966), in his official history of the National Bureau of 
Standards, also includes detail. Table 3.2 lists some national laboratories. The 
capabilities, which cannot be totally extensive, of the various national measure- 
ment laboratories are usually described in booklets published periodically, 
NML (1975) and NBS (1977), being examples. Such booklets often include 
lists of people and services that the various laboratories can provide. The 
laboratories usually offer more than standards maintenance, their specialist 
apparatus and staff being available for solving the more specialized and unusual 
measurement problems that the general industry cannot meet. The practice of 
standards and calibration is the subject of a recent series of articles by various 
authors, published in Measurement and Control (1978). Much of the work of the 
national physical standards group appears in the journal Metrologia. 

Nomenclatures for the physical standards have not yet achieved a single 
uniform terminology. Some guidance and historical background are to be found 
in McNish (1958), and Cochrane (1966), but the definitive sources are, of course, 
the various international and national glossaries issued on the subject of general 
terminology of metrology already mentioned in Section 3.2.1. 

Several classes of standard exist and confusion is commonplace about the 
terminology that should be applied. The following definitions are based upon 
the English language entries of the PD 6461 document referred to earlier. 

An international standard of measurement is that ‘recognized by an inter- 
national agreement to serve internationally as the basis for fixing the value of 
all other standards of the given quantity’. This may need to be practically 
compared with the sole global standard, such as in the case of the kilogram, or it 
may be capable of development at a national level, as is the case with the standard 
of length where many countries operate their own krypton interferometers. 

The national standard of a measurement is that ‘recognised by an official 
national decision as the basis for fixing the value, in a country, of all other 
standards of the given quantity’. Theoretically, and on occasion practically, this 
may not be the standard having the highest metrological performance in that 
country (which is termed the primary standard) but it usually is. It is quite 
possible for better apparatus to be developed than the defined standard but time 
must elapse before it can be legally instituted to replace that then existing as the 
national standard. Research establishments will often be able to claim better 
per ormance in terms of discrimination, repeatability, and reproducibility for 



62 


HANDBOOK OF MEASUREMENT SCIENCE 


Table 3 2 Names and location of selected national bodies responsible for legal national 
physical standards (Courtesy of National Measurement Laboratory Sydney) 


Country 


Name and location 


Argentina 


Australia 


Austria 


Belgium 


Brazil 


Bulgaria 


Cameroon 


Canada 


Chile 


The Director General 

Institute Argentine dc Racionalizacion de Materiales (IRAM) 
(Argentine Standards Institute) 

Chile Road 1192 
Buenos Aires Argentina 

The Director 

National Measurement Laboratory CSIRO 
PO Box 218 BradfieldRoad 
Lindfield NSW 2070 Australia 

The Director 

Bundesamt fur Eich und Vermessunswesen 

16 Arligasse 35 

1163 Wien (Vienna) Austria 

Ingenicur en Chef 

Directeur du Service Beige de la Meirologie 
24 26 rue J A De Mot 
B 1040 Bruxelles Belgium 

The Director General 

Instituto Nacional de Pesos e Medidas 

Rodovia Washington Luiz Quilometro 23 XPrem 

Municipio de Duque de Caxias Estado 

Rio de Janiero Brazil 

The Vice President 

Comite d Etat de Normalisation 

PO Box II 

lOOQ Sofia Bulgaria 

The Chief 

Service Central des Poids et Mesures 
Ministere de 1 Economic et du Plan 
Boite Postal 493 
Douala Cameroon 

The Director 
Division of Physics 
National Research Council 
Ottawa KIA OR6 Canada 

The Chief 

Division ofMetroIoey 

Instituto Nacional de Normahzacion 

Casilla 995 

Correo 1 

Santiago Chile 
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Table 3 2 {continued) 


Country 

Name and location 

Czcchoslavakia 

The Vice President 

Urad pro Normalizaci a Mereni 

Vaelavske namesti c 19 

11347 Praha (Prague) 1, Nove Mesto 

Czcchoslavakia 

Denmark 

The Director 

Justervaesenet (Bureau of Weights and Measures) 

Amager Boulevard 115 

DK-2300 

Copenhagen, Denmark 

Cuba 

The Director 

Centre de Recherches Metrologiques 

Comite Estatal de Normahzacion 

5 ta 306 e/CyD Vedado Habana, 4 

Cuba 

Egypt 

The Director 

National Institute for Standards 

National Research Centre 
al-Tahnr Street, Dokki 

Cairo, Egypt 

Cyprus 

Senior Officer 

Research and Industrial Development 

Ministry of Commerce and Energy 

Nicosia, Cyprus 

Finland 

The Director-General 

Valtion teknillinen tutkimuskeskus 
(Technical Research Centre of Finland) 

Vuonmiehentie 5 

02150 Espoo 15, Finland 

r ranee 

President 

Comite de direction 

Bureau National de Metrologie 

8-10, rue Crillon 

75194- Pans Cedex 04, France 

German Democratic 

The Vice President 

Republic 

Amt fur Standardisierung, Messwesen und Warenprufung 
Hauptabteilung Gesetzhche Metrologie 

Wallstrasse 16 

1026 Berlin, German Democratic Republic 

Federal Republic of 

The President 

Germany 

Physikahsch-Techmsche Bundesanstalt 

Bundesallce 100 

33 Braunschweig, Federal Republic of Germany 

(tonlinued) 
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Table 3 2 {conimued) 


Couniry 

Name and location 

Hungary 

President 

Orszagos Meresugyi ffivafal 

Nemet'olgyi ut 37/39 

Budapest Xll, Hungary 

India 

The Director 

National Physical Laboratory 

Hillside Road 

New Delhi 12 India 

Indonesia 

The Chief 

Service of Nfeirology 

Departemen Perdagangan 

Direktorat Metfologi Standardisasi and Normalisasi 

D/alan Pasteur 27 

Bandung Indonesia 

Iran 

Director General 

Institute of Standards and Industrial Research 

Ministry of Industries and Mines 

PO Box 2937 

Teheran Iran 

Ireland 

The Chief 

Department of Physics 

Institute for Industrial Research and Standards 

Ballyntun Road 

Dublin 9, Ireland 

Italy 

The Chief 

Uffiao CeniraJe Mefrrco 

Via Antonio Bosio 15 

00161 Rome, Italy 

Japan 

The Director 

National Research Laboratory of Metrology 

10-4 1 Chome. Kaga. Itabachi ku 

Tokyo Japan 

Republic of Korea 

The Director 

National Industnal Standards Research Institute 

199 Dongsoongdong Chongno ku 

Seoul, Republic of Korea 

Mexico 

The Director General 

ConsQO Naaonal de Ciencia y Tecnologia 

Insurgentes Sur 1677 

Mexico 20 DF, Mexico 
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Table 3,2 {continued) 

Country Name and location 

Netherlands The Director 

Institute of Theoretical Physics 
Universiteit van Amsterdam 
Spui 21 

Amsterdam, The Netherlands 

Norway The Director 

Justerdirektoratet 

(National Bureau of Weights and Measures) 

Postbox 6832 
St Olavs Plass 
Oslo 1 , Norway 

Pakistan The Director 

Pakistan Standards Institution 
39 Garden Road, Saddar 
Karachi 3, Pakistan 

Poland The President 

Polski Komitet Normalizacji i Miar 
ul. Elektoralna 2 
00-139 Warszawa, Poland 

Portugal Director of Quality 

Secretariat de Estado da Industria Pesada 
Direkao Geral dos Services Industrias 
Industrial Rua Jose Estevao, 83-A 
Lisboa 1, Portugal 

Romania The Director 

Institulul National de Metrologie 
Sos. Vitan-Birzesti No. 11 
Bucharest 5, Romania 

Spain The Secretary 

Comision Naciona! de Metrologia y Metrotecnica 
3 callc del General Ibanez Ibero 
Madrid 3, Spain 

South Africa The Director 

National Physical Research Laboratory 
Council for Scientific and Industrial Research 
PO Box 395 

Pretoria 0001. South Africa 

Sweden The Director General 

Statens Provningsansialt 
PO Box 857 
S-501 15 Boras 
Stockholm. Sxs'eden 


{continued) 
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Table 3 2 (continued) 


Country Name and location 


Switzerland The Director 

Office Tederal de'Metrologie 
Lindenweg 50 

3084 Wabcm/Be, Switzerland 

Thailand The Director 

Division des Poids el Mesures 
Ministcres des Affaires Economiques 
Bangkok, Thailand 

Turkey The Director 

Service des Mesures et des Etalons ct Mmistere Commerce 

Ticaret Bakanligi 

Okuler ve Ayariar Mudur Vekili 

Bakanliklar. Ankara, Turkey 

USSR The Chief 

Gosstandarl 
Lemnsky Prospect 9 
Moscow M 7049 USSR 

United Kingdom The Director 

biaUAwa!. PiWis.<iaJ, 

Teddmgton, Middlesex TWIJ OLW 
UK 


United States of 
America 


Uruguay 


Venezuela 


Yugoslavia 


The Director 

National Bureau of Standards 
Washington DC 20234, USA 

The President 

Insfituto Uniguayo de Mormas Tecnicas 
Avda Agraciada 1464 
Montevideo, Uruguay 

Metrologiste en Chef 

Servico Nacional de Metrologia Legal 

Ministeiio dc Fomento 

Av Javier Ustanz 

Edif Parque Residenaal 

Urb San Bernardino 

Caracas, Voiezuela 

The Director 

Sawzm'z2fsD62aVlereiT5ragocene'Meta‘ie 
Mike Alasa 14 
1 1 000 Beograd, Yugoslavia 
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some of their equipment and they may need to adopt it as their own internal 
standard but that does not make it the legal standard. Carefully controlled 
scientific and legal processes must be used to monitor any newly proposed 
apparatus to ensure that a change is certainly for the better in the long term. 
For example, proposals for the adoption of an absorption-stabilized laser as the 
new length international standard have been under observation for many 
years. The term prototype, which was once used to denote an international 
standard, is now deprecated, except in a historical context. 

A secondary standard is that arrived at by some form of comparison with the 
primary standard or reference standard (see below). The term is used to describe 
either a subsidiary or a hierarchical place in the traceable chain. Thus its use 
can be somewhat confusing for the term does not inherently imply anything 
about its quality compared with the primary standard. It is not necessarily 
inferior. The term substandard is not included in the PD 6461 document and, 
therefore, should presumably be avoided in use. 

Obviously only one location can have the primary standard so practical 
requirements dictate the use of a local sole standard to which all others in the 
locality are made traceable. This secondary standard is called the reference 
standard. The reference standard must also be carefully maintained in controlled 
conditions of ambient and use so as to preserve its calibration. From the reference 
standard it is necessary to form other standards that can be used at the more 
hazardous (for the instrument) working face. These are called working standards, 
sometimes also referred to as field standards. 

Natural processes are often used to standardize measurements, the units 
being defined as a man-made arbitrary decision. The use of a radiation wave- 
length has already been mentioned. This form of standard is given the name 
reference value standard. Another example is the use of certain chemico-physical 
points to give the international practical temperature scale (IPTS). These enable 
a standard to be established without the need to compare it with a unique 
defined apparatus. As an example the National Bureau of Standards has declared 

that one brand of marketed laser interferometer suffices as a reference standard 
for length. 

Breaking down a measurement situation, wherein numerous interconnected 
attributes are needed to define just one quality, into the appropriate quantities 
existing in the defined SI system is often prohibitive. Examples are to define a 
Stan ard for a particular grade of iron, for the various chemicals, and for 
Stan ard pollution levels. An alternative available is to use reference materials 
(not defined in the PD 6461 document). These are materials, or substances, that 
are officially recognized as a standard of certain attributes. They are charac- 
enz by having a high degree of stability of one or more of their chemical, 
Pijsicai or other metrological properties. They are called standard reference 
materials (SRM’s) in the United States (where they largely originated), and 
TTrri/ierf reference materials in some other countries. 
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j^gfej-gnce materials may be consumed m the measurement process or they 
may be reusable after periodic calibration They are also often used to conduct a 
quality audit of a group of cooperating standards laboratories by requesting 
each to measure the same SRM and declare their results for overall scrutiny 
Audits, also called round robins lead to better standard specifications as they 
give realistic answers to the real capability of that industry As an example, a set 
of variously sized objects was circulated for stated dimensional quantities to be 
determined The results were studied and a new revised standard for tolerancmg 
was eventually issued 

Within a firm making many parts of the same size and form it is common 
practice to hold samples of the product that are the firm's local reference It is 
more expedient to use these than to laboriously measure each individual 
parameter, one at a time The samples act as standards for comparison 
Other forms of reference material are those pure substances used to define 
such units as density, the IPTS triple points, and the krypton used m the length 
interferometer The purity and the properties of the substance are used to 
maintain the value of a defined quantity 
The introduction of what can be properly called universal systems of 
physical units and their standards began m earnest m the 18 th century with the 
proposal of the French to use a metric numerical basis and a given set of primary 
units The metric SI system evolved from this after many years of partial 
adoption, change, consideration and (occasionally) inaction 
Units may arise inherently following the adoption of a freely available 
natural standard, such as the use of seeds from plants or the length of the human 
foot The unit is given a name that describes the standard, in these cases it was 
the carat and the foot respectively Once units have been established in this way 
It remains for them to be refined, adopting better standards (which ultimately 
are not freely available) as they are devised In the early days of laser develop- 
ment a commonly used standard of power output was the number of Gillette 
razor blades that could be vaporized through This was a very convenient form 
of standard, the units being as so many gillettes However, to allow standards to 
arise in this uncontrolled way leads to confusion and lack of accuracy with its 
inherent lack of interchangeability of results who can be certain that razor 
blades of all manufacturers would have the same fusing properties ' History has 
well shown that units and standards need careful and tight control if the com- 
munity IS to get the best rate of progress from them The need for new units and 
standards arises continuously as fundamental research discovers new effects 
When this happens those concerned should act responsibly giving careful 
thought to the units and standards they adopt These should be traceable within 
the SI system and be adopted after agreement by workers in similar fields 
Unfortunately progress must first be made using interim standards which will 
provide some control whilst clear realization of the units involved is allowed to 
emerge 
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The magnitude of the unit adopted is an important consideration for many 
reasons. Too small a unit can result in large numbers or much subdivision, the 
opposite is clearly also inefficient. For this reason the scale of units has tended 
to be matched to the task. The angstrom is a convenient magnitude for describing 
atomic dimensions, the light-year is suited to galactic space distances. However, 
the practice of adopting units for the sake of convenience alone leads to multi- 
plicity of units, the result being that the need for conversion becomes common- 
place with its inherent potential for error. The SI system comes close to this ideal 
using decades of thousands and unity units. The kilogram is, however, an 
important incorrect departure from the fundamental philosophy for it is not of 
unit value. 

The now obsolete British Imperial, foot-pound-second, system resulted from 
centuries of chaotic development in which expediency of the local needs seems 
to have been the main criterion adopted. Early metric systems used a centi- 
metre-gram-second (CGS) basis; the MKSA system used the metre-kilogram- 
second-ampere basis. The Americans adopted the Imperial system in use at the 
time of their colonization and today still retain certain units as they were in those 
times— this is the reason why the US gallon is smaller than the now obsolete 
British gallon. 

The system of units used affects trade probably more than any other sector of 
the developed country. It was the Conference Generale des Poids et Measures 
(CGPM) that, in 1954, agreed to a single uniform and very comprehensive 
system based upon the various forms of the metric system then in use. After 
1954, the BIPM, mentioned earlier, yet again became the centre for revision of 
the metric system in use. In 1954 it was generally agreed to change from the 
several systems in use to the ‘Practical system of units’ which added thermo- 
dynamic, temperature, and luminous intensity units to the list of base units. In 
1960 it became known as the Systeme International d’Unites or just SI. In 1971 
the chemical unit of substance, the mole, was added to the already agreed base 
units to give the present seven base units. 

Although the SI system provides a framework for total uniformity between 
all nation’s systems of units, it is not adhered to in every aspect. Each country 
has usually retained a few of its formerly used units, the nautical ‘knot’ being 
one example. For this reason it is necessary to consult the national standard 
statement, not the prime international SI document. Table 3.3 lists some of the 
major standards documents issued on use of the SI metric system. 

From the base units of the SI system it is possible to derive most other units 
that are ever needed by the dimensionally appropriate multiplication and 
division of the base units. The metric standards also contain statements about 
the preferred routes to obtain derived units. Figure 3.2 gives a chart showing the 
base units and derived units having special names which are obtained by appro- 
priate combination. This arrangement allows calibration of a derived unit in a 
way that is traceable to the primary standards apparatus and that will, if users 
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Table 3 3 Selected SI metric units standards documents (See Tables 3 4 and 3 5 for 
explanation of the code letters) 


Documenl reference 

Title 

ISO 1000 1973 

SI umls and recommendations for the use of their mull pies 
and of certain other units 

BS 3763 1973 

The International System of Units (SI) 

DS 5555 

SI units and recommendation for the use of their multiples 
and certain other units (endorsed ISO 1000 1973) 

BS PD 5686 

The use of SI units 

NBSSP 330 1977 

ANS1Z210 1 1976 ] 

and 

The international system of units (SI) (translation approved 
by BIPM of the publication entitled Le Systeme 
Inicrnational d Un tes) 

ANSl/ASIM E380 1976 
and 

ANSI IEEE 268 1976 J 

American national standard metric practice (conversion) 

ASTM E380 1970 

Standard metric practice guide (a guide to the use of 

SI the International System of Units) 

DIN 1301 Tell 2 

Units sub multiples and multiples for general use 

AS 1000 1974 

See also 

The international system of units (SI) and us applicatioti 

ANSI 

Metric package 

ANSI 

A bibliography of metric standards— SPlIb 

American National Standards Institute New York 

BIPM 1 

Le Sysieme Iniemaiional d Unties International Bureau 
, of Weights and Measures (BIPM) and the authorized 

NPL 

English language versions from the National Physical 

NBS J 

Laboratory UK and National Bureau of Standards USA 


abide by the directites issued be calibrated by the same path in all places 
KnotWedge of the theory and practice of dimensional analysis is needed for (his 
ojieration The variable must first be identified in its dimensional form which 
then leads to paths to follow to combine units For example to obtain calibra 
tion for acceleration transducers correctly requires a system of calibrations that 
derives velocity from the metre and the second combined with the second again 
to obtain acceleration Texts on dimensional analysis are by EsnauU Pellene 
(1950) Massey (1971) and Pankhurst (1964) 

Due to the rapid upsurge in adoption of the metric system in recent times 
many texts and standards about the metric system have appeared, examples 
include Bradshaw (1975) ChisweHandGngg(l971) Feirer(1977) Metrication 
BoarrT UK (1971) and O Neill (1976) National standards organizations issue 
defining documents for use of SI (see Table 3 3> 
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DoT!»d tin** Ind)C*t* diviiion 
Unbrok«n I net In^icvt* multipitcition. 


Figure 3.2 Hierarchy of SI base and derived physical measurement units having special 
names (Courtesy of Metric Conversion Board, Australia (1972), after chart in Metric 
Handbook: SAA MHI-1972, Standards Assoc, of Australia, Sydney) 
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A number of books have given explanations of the development of systems of 
standards (see Ellis, 1973, Klein, 1975, Dellow, 1970, Page and Vigoureux, 
1975 Further references are available in Sydenham (1979) 

Quantities in general, along with their units and symbols are covered in the 
periodically revised work issued by the Royal Society (for the latest, see Royal 
Society, 1975) This, however, is not the sole source of symbols, those used vary 
widely (see Lowe, 1975) It is generally safer to state the system of symbols used 
when publishing documents for there is a very real chance that the same symbol 
will represent several quantities, especially in instrumentation which transcends 
many boundaries Although they are not likely to come together m the disciplines 
they arose in, they often do m measurement applications Authors should be 
at pains to state the symbols used giving reference to the definitive source or to 
their own usage Further references to published standards and related material 
on terminology for the individual subjects of measurement science are to be 
found in the respective chapters of this work 
The introduction of the metric system has probably never caused more debate 
and recorded statement than was the case in the United States of America The 
National Bureau of Standards (NBS) has issued reports on the debate (Simone 
and Treat, J97i. Simone, 197j) Verman and Kaul (1970) record Indias 
reaction to the change to the metric system 
Byinternational agreement thedeveloped countries have gradually introduced 
into their public service sectors institutions that establish and maintain the 
primary standards for the base units To a varying degree, they also provide 
traceability for the denved standards Several have already been mentioned 
Table 3 2 lists the places concerned for many countries The operation of these 
institutions is not standardized and they each play slightly different roles in 
their countries’ affairs They do, however, maintain physical standards to agreed 
procedures Penodically they mtercompare their individual results with each 
other in order to decide the uncertainty figures that should be assigned to the 
declared standards 

Not all countries using the metric system have the capacity to maintain base 
unit standards Those that do not often have to rely upon the assistance of other 
nations They generally begin by establishing standards for trade reasons, 
setting up legal weights and measures control and publishing standard speci- 
fications, traceable, higher levels of standardization for physical units are then 
added as time passes A considerable number of the developing nations are still 
without adequate local standards apparatus 
United Nations programmes, such as those implemented through UNIDO, 
UNDP, and other aid schemes, are gradually implementing a range of bilateral 
arrangements which will progressively spread knowledge and capability to 
maintain traceable calibrations chains throughout the world Additions during 
the 1970’s include the Korean Standards Research Institute (KSRI) with its 
network and the Brazilian system of weights and measures 
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Eventually it is to be presumed that all countries will be party to the inter- 
national agreement of the metre under control of the work of the BIPM, and that 
most will possess their own base unit facilities where they have a large enough 
population to warrant a complete traceable system. However, there is a long 
way to go in this regard (the cubit is still used in the highlands of an island in the 
Asian region), but already considerable progress has been made since the forma- 
tion of the UN in 1945. 

It must be stressed that the use of uncalibrated, non-traceable standards is to 
be discouraged, especially when the work is likely to spread into international 
cooperation at high levels of standardization. The uncalibrated instrument, and 
even worse the instrument that cannot be calibrated, are likely to cause con- 
siderable inefficiency and rework of research or of manufactured parts, work 
that all too often has to be carried out without the reasons for doing so being 
properly identified and the cause corrected. A balance must, however, be struck 
between the calibration level and periodicity employed with the task in hand; 
calibration is not without its cost. Sydenham (1978) provides a summary of the 
things that a calibration facility operator might need to know about in operating 
such a facility. 


3.5 STANDARDS OF SPECIFICATION 

The standards referred to above are concerned with definition of the physical 
units such as length, mass, time, velocity, pressure, temperature, density, and 
so on. They exist as physical apparatus and form a class of standard. A second, 
much larger group of standards— they are generated at around 2000 per month 
across the world (BSI, 1978); the US alone has around 600 organizations 
producing them (Slattery, 1971, 1972)— comprises those issued, as documents, 
to define such matters as terminology, methodology, testing procedures, 
tolerances, analyses of materials, safe practices, classification schemes for 
products, product performance, and other characteristics. These are generally 
termed either standard specifications, commercial standards, industrial standards 
or technical standards. The Standards Council of Canada, who operate the 
National Standards System of Canada, officially defines this form of standard 
(SCC, 1976), as ‘approved rules for an orderly approach to a specific activity’. 
There does not appear to be a formal generally accepted definition in existence 
that distinguishes this form of standard from those of the measurement units 
discussed in Section 3.4. 

These pertain, directly and indirectly, to the quality and performance of 
products and services. They began as standardization for industry spreading 
later into consumer products. They occur as codes, rules, regulations, and 
specifications to give control to the supply, erection, use, and operation of 
articles, materials, and equipment of all kind. They may pertain to raw materials, 
components, subassemblies, finished products, safety, design, construction. 
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testing, and quality of performance If it can be measured or related to measure- 
ment subjects It could be the subject of a standard specification 

Some of the standards issued contain only subjective measurement statements, 
they do not necessarily relate to physical measurement of parameters of their 
subjects The name standard is, however, likely to be confusing Indeed it has 
caused considerable confusion with physical struidards The National Standards 
Laboratory, Sydney, changed its name to the National Measurement Labora- 
tory (NML) largely because of this its functions were being confused with 
those of the Standards Association of Australia (SAA) which authorizes the 
standard specifications 

Standard specifications generally deal with standardization of terms and 
definitions, creation of standards of design or performance (which helps the 
tendering process by providing standardized statements for writing into con- 
tractual agreements) with the prescription of standards of quality and the 
associated tests by which to decide the quality, with rationalization of sizes and 
levels of quality to keep the options down to an efficient minimum number, and 
with the control of dimensions for components so that they can be reliably 
interchanged 

England is the acknowledged birthplace of this form of standard— the first 
such case arose with the standard thread that Whitworth proposed m 1840 
The British Standards Institution (BSI) was formed in 1901 Standards were 
progressively evolved by the work of many interested parties, they are not issued 
solely by civil service action They are not in themselves all necessarily legal 
identities but many are required by relevant authorities, such as electric power 
supply companies, to be adhered to if a product is to be sold or used m a co- 
operating situation Insurance companies, learned institutions, industrial 
consortia, government agencies, statutory authorities, private firms, and the 
public at large can each have a hand m establishing the consensus needed for 
the generation of a standard specification Specifications may be issued by a 
national institution and also by professional bodies, the BSI and the American 
Society of Mechanical Engineers (ASME) being examples of each The majority 
of such specifications are adhered to voluntarily, but some are enforced 
compulsorily 

The first stage in preparation of a standard specification is for a responsible, 
appropriate, and authorized body to study the case put to it to establish if the 
need for a specification is worthy The various organizations usually have 
established committees or councils responsible for defined areas The need for 
a new standard, or for a revision, is generated by a request arising from any 
source or from the members of the comimttees who may sense the need from 
their intimate knowledge of the area they represent 

A conference will then be called of relevant organizations who will use the 
eventual standard If this conference agrees to the need it decides the terms of 
reference and the task is put into the hands of the relevant drafting committee 
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The drafting committee acts in a largely autonomous manner proceeding with 
the task of preparing a draft standard or draft proposal (DP) for public review 
and comment. This might be done through subcommittees if the task is large. 

The draft document is publically circulated for several months to give knowl- 
edgeable persons and bodies a chance to prepare a submission of their views 
about the draft proposal which they put to the committee. These views are taken 
into consideration by the drafting committee who finalize the document and 
then submit it to their authorizing body who scrutinize the whole operation. 
Finally, if satisfied as the result of a voting procedure, the latter issues the final 
draft as the declared standard. Creating the international organization stan- 
dards requires more stages and is usually done by correspondence rather than 
by face-to-face meetings. 

The material contained in the standard can be brought together by many 
means. It might be adapted from an already existing standard of the two main 
international organizations, ISO and lEC, or be from another country. If it is 
identical with the already existing standard it is declared as being endorsed by the 
next user who issues a code number in line with the local numbering system. 
Another way to obtain the content is from a round robin procedure that decides 
what is a reasonable and established good practice to write into the standard. 
Some standards are produced from the work of the committee, the members of 
which put together their experience together with solicited information. Staff 
of the national standards body may assist. A trade body may provide a basis 
for the draft. In short, any competent person or body can participate in this way. 

It is important to realize that standard specifications are not statements made 
for all time. They are always subject to review so that they will remain an accurate 
statement of contemporary workable practice. Some date faster than others. 
Threads, for example, have been reviewed at only roughly twenty-year periods 
whilst electric lamps needed review at some two-yearly intervals in their early 
years of issue. 

A national standards association generally has an official logo as its mark of 
conformity. Only on products and the like that meet the standards, and are 
tested to do so by an appropriate authority, can the official logo be used. 

Identifying products and causing them to comply with standard specifica- 
tions can be a very valuable method of stabilizing quality and performance. It 
also greatly reduces contractual statements and engineering specification 
writing. It leads to efficiency in manufacture and use by aiding stability and 
rationalization. However, a note of caution must be sounded that the use of 
standard specifications does have disadvantages. Lack of detailed compliance 
to a given standard can be used to provide protection for an industry within a 
country from imports. It also slows up the release of products onto the market. 
Designs and contract writers must also keep themselves up to date with re- 
visions. Singular, international agreement and endorsement of all standards by 
all countries will, theoretically, eventually lead to common codes but this seems 
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rather an impossible hope m view of the number of standard specifications that 
are in existence 

For standard specifications to be effective they must be freely available for 
consultation and purchase and there must be a suitable organization to handle 
the administration of sales and updates, advice and standards draftmg Tech- 
nologically based countries generally have a government agency to do this 
Table 3 4 gives a list of national level standards organizations, note the existence 
of the tno mtemational bodies ISO and lEC Burton (1976), Mason and Peiser 
(1971) Ollner (1974), SAA. (1977), Sanders (1972), Stewart (1977), and Verman 
and Kaul (1970) each provide insight into standardization and its procedures 

Specific area standards are mentioned in the respective chapters given m this 
handbook There are too many standards on measurement-related subjects for 
them to be given here those in the well known standards organizations lists are 
relatively easy to locate especially if a standards library can be used A complete 
set of national standards and some of other organizations is usually to be found 
m the offices of the national body but it is unrealistic to expect that all standards 
known can be found in a single location The BSI library receives as complete a 
set as can probably be found The usual approach to the problem is to collect, 
through purchase, those that relate to one’s specific area of endeavour 

Three useful and relevant documents to measunng systems, not m those 
systems of publications, are a guide to purchase procurement of complex 
electronic and supervisory systems (CAMA, 1974), an in-house publication 
comparing the symbols used in electncal systems for m the United Kingdom, 
United States of America, Canada, Germany, and as stated by the International 
Electrotechnical Commission (TEC) (Neumuller, 1973), and the Instrument 
Society of Amenca (ISA) publication of standards and practices for instru- 
mentation (ISA, 1977) 

At the mtemational level of standardization there exist two complementary 
organizations ISO and lEC ISO, the International Organization for Stan- 
dardization, had m 1978, 68 members, 17 corresponding members and it liaised 
then with over 350 organizations By 1978 ISO had produced 3750 ISO 
standards documents, these being the result of the work of 100, 0(X) experts 
contnbutmg through 1940 technical bodies A general guide is issued (ISO, 
1979a) along with details of the technical programme (ISO, 1979b) and an 
annual catalogue (ISO, 1979c) Liaisons are officially hstedm ISO (1977a) ISO 
(1978) provides definition of the terms used in processes of standardization and 
certification A senes entitled ‘Bibliography’ is published as guides to general 
material of permanent nature ISO (1977b) is concerned with standards for 
documentation and terminology 

With 68 member countnes, some of which internally have several hundred 
standardizmg agencies, it is clear that the task of locating the existence and 
whereabouts of a standard on any particular subject can only be attempted 
efficiently by computer-based procedures In 1975 ISO decided to create a 
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Table 3,4 Standards organizations having membership of ISO (from ISO, 1979a). (Code 
letters denote organization name not standards code letters, which are given in Table 3.5) 


Inteniatiofwl 

International Organization for 
Standardization (ISO) 

1, rue de Varembe 
Case postale 56 
CH-1211 Geneve 20 
Switzerland 

National 
Albania (BSA) 

Byroja e Standarteve 

Prane Komisionit te Planit te shtetit 

Tirane 

Australia (SAA) 

Standards Association of Australia 
Standards House 
80-86 Arthur Street 
North Sydney, NSW 2060 

Bangladesh (BDSI) 

Bangladesh Standards Institution 
3-DIT (Extension) Avenue 
Motijheel Commercial Area 
Dacca 2 

Brazil (ABNT) 

Associa?ao Brasileira de Normas Tecnicas 
Av. 13 de Maio, n° 13-28° andar 
Caixa Postal 1680 
CEP; 20. 000, Rio de Janeiro 

Canada (SCC) 

Standards Council of Canada 
International Standardization Branch 
Meadowvale Corporate Centre 
2000 Argentina Road, Suite 2-401 
Mississauga, Ontario 
L5N 1V8 

China (CAS) 

China Association for Standardization 

PO Box 820 

Peking 


International Electrotechnical 
Commission (lEC) 

1, rue de Varembe 
Case postale 56 
CH-1211 Geneve 20 
Switzerland 


Algeria (INAPI) 

Institut algerien de normalisation 
et de propriete industrielle 
5, rue Abou Hamou Moussa 
BP 1021, Centre de tri, Algiers 

Austria (ON) 

Osterreichisches Normungsinstitut 
Leopoldsgasse 4 
Postfach 130 
A- 1021 Vienna 2 

Belgium (I BN) 

Institut beige de normalisation 
Av. de la Brabangonne, 29 
B-1040 Brusseles 

Bulgaria (DKC) 

State Committee for Standardization 
at the Council of Ministers 
21, 6th September Str. 

Sofia 

Chile (INN) 

Instituto Nacional de Normalizacion 
Matias Cousino 64-6° piso 
Casilla 995, Correo 1 
Santiago 


Colombia (ICONTEC) 

Instituto Colombiano de Normas 
Tecnicas 

Carrera 37 No. 52-95 
PO Box 14237, Bogota 


{continued) 
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Table 3 4 (conimiied) 


Cuba (NC) 

Comite Estatal de Normalizacion 
5ta nr 306 entre c y d vedado 
Zona postal 4 
Havana 

Czecliosloiakia (CSN) 

Ufad pro normalizaci a mefeni 
Vaclavske natnesii 19 
113 47 Prague 1 

Egipi Arab Republic of (EOS) 

Egyptian Organization for Standardization 
2 Latin American Street 
Garden City 
Cairo, Egypt 

Finland (SFS) 

Suomen Standardisoimisliitto r y 
PO Box 205 
SF^121 Helsinki 12 


German}, FR (DIN) 

DIN Deutsches Institut fur Nomiung 
Burggrafenstrasse 4 10 
Postfach 1107 
D-1000 Berlin 30 

Greece (ELOT) 

Hellenic Organization for Standardization 
Didotou 15 
Athens 144 


India (ISI) 

Indian Standards Institution 
Manak Bhavan 
9 Bahadur Shah 2^faf Marg 
New Delhi 110002 


Iran (ISIRI) 

Institute of Standards and 
Industrial Research of Iran 
Ministry of Industries and Mines 
PO Box 2937 
Teheran 


Cyprus (CYS) 

Cyprus Organization for Standards 
and Control of Quality 
Ministry ol Commerce and Industry 
Nicosia 

Denmark (DS) 

Dansk Standardisenngsraad 
Aurehovej 12 
Postbox 77 
DK-2900 Hellcrup 
Ethiopia (ESI) 

Ethiopian Standards Institution 
PO Box 2310 
Addis Ababa 


frowe (AFNOR) 

Association franeaise de normalisation 
Tour Europe 
Cedex 7 

92080 Pans La Defense 
Ghana (GSB) 

Ghana Standards Board 
PO Box M 245 
Accra 


Hungary (MSZH) 

Magyar Szabvanyugyi Hivalal 

Budapest 

Pf 24 

1450 

Indonesia (YDNI) 

Yayasan Dana Normalisasi Indonesia 
Indonesian Institute of Sciences 
Jalan Teuku Chik Ditiro No 43 
PO Box 250 
Djakarta 

Iraq (lOS) 

Iraqi Organization for Standards 
Planning Board 
PO Box 13032 
Baghdad 
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Tabic 3.4 (cotiiimied) 


Ire/am/ (URS) 

Institute for Industrial Research and 
Standards 
Ballymun Road 
Dublin 9 

Italy (UNI) 

Ente Nazionale Italiano di Unificazionc 
Piazza Armando Diaz 2 
I 20123 Milan 

Jamaica (JBS) 

Jamaican Bureau of Standards 
6 Winchester Road 
PO Box 1 13 
Kingston 10 


AVHra(KEBS) 

Kenya Bureau of Standards 
PO Box 54974 
NHC House 
Harambee Avenue 
Nairobi 


Korea, Republic of (KBS) 

Bureau of Standards 

Industrial Advancement Administration 

Yongdeungpo-Dong 

Yongdeungpo-Ku 

Seoul 

Libyan Arab Jamahiriya (LYSSO) 

Libyan Standards and Specifications 
Office 

Department of Industrial Organization 

Secretariat of Industry 

Tripoli 


Israel (Sll) 

Standards Institution of Israel 
42 University Street 
Tel Aviv 69977 


Ivory Coast (BIN) 

Bureau ivoirien de normalisation 

BP 1318 

Abidjan 

Japan (JISC) 

Japanese Industrial Standards 
Committee 

c/o International Standards Office 
Standards Department, AIST 
Ministry of International Trade and 
Industry 

33rd Mori Bldg 3-8-21, Toranomon 

Minato-ku 

Tokyo 105 

Korea, People's Democratic 
Republic of (CSK) 

Committee for Standardization of the 
Democratic People’s Republic of 
Korea 

Committee of the Science and 
Technology of the State 
Sosong guyok Ryonmod dong 
P’yongyang 

Lebanon (LIBNOR) 

Institut libanais de normalisation 

BP 195144 

Beyrout 

Malaysia (SI RIM) 

Standards and Industrial Research 
Institute of Malaysia 
Lot 10810, Phase 3, Federal Highway 
PO Box 35, Shah Alam 
Selangor 


{continued) 
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Table 3 4 (continued) 


Mexico (DGN) 

Direccion General de Nonnas 
Tuxpan No 2 
Mex:co 7. DF 


Netherlands (NNI) 

Nederlands Normalisatie instituut 
Polakweg 5 
PO Box 5810 
228Q HV Ryswyk ZH 

Nigeria (NSO) 

Nigerian Standards Organisation 
Federal Ministry of Industries 
1 1 Kofo Abayomi Road 
Victoria Island 
Lagos 

Pakistan (PSI) 

Pakistan Standards Institution 
39 Garden Road, Saddar 
Karachi 3 


Philippines (PS) 

Philippines Bureau of Standards 

TML Bldg 

100 Quezon Avenue 

Quezon City, Metro Manila 

PO Box 3719 

Manila 

Portugal (DGQ) 

Direccao Geral da Qualidade 
RepartifSo de Normaliza^ao 
Rua Jose Estevao, 83-A 
Lisbon 1 

Saudi Arabia (SASO) 

Saudi Arabian Standards Organization 
Airport Street 
PO Box 3437 
Riyad 


Morocco (SNIMA) 

Service de normalisation industrielle 
marocaine 

Direction de I Industrie 
Ministere du Commerce, de hndustrie, 
dcs mines et de la marine marchande 
Rabat 

Neu Zealand (SNUZ) 

Standards Association of New Zealand 

Private Bag 

Wellington 


Norway (NSF) 

Norges Standardisenngsforbund 
Haakon VII s gate 2 
N Oslo 1 


Peru (ITINTEC) 

Instituto de Investigacion Tecnologica 
Industrial y de Normas Tecnicas 
Jr Morelli— 2da cuadra 
Urbaniiacton San Borja— SutquiUo 
Lima 34 

Potend (PKNiM) 

Polski Komitel Normahzacji i Miar 
Ul Elektoralna 2 
00-139 Warsaw 


Romania (IRS) 

Institutal Roman de Standardizare 
Casula Portals 63-87 
Bucharest ] 


Singapore (SISIR) 

Singapore Institute of Standards and 
Industrial Research 
179, River Valley Road 
PO Box 2611 
Singapore 6 




STANDARDIZATION OF MEASUREMENT FUNDAMENTALS AND PRACTICES 


81 


Table 3.4 {continued) 


South Africa, Republic o/(SABS) 

South African Bureau of Standards 

Private Bag XI 91 

Pretoria 


Sri Lanka (BCS) 

Bureau of Ceylon Standards 
53 Dharmapala Mawatha 
Colombo 3 


Sweden (SIS) 

SIS— Standardiseringskommissionen i 
Sverige 

Tegnergatan 1 1 
Box 3 295 

S— 103 66 Stockholm 
77!ai7W(TISI) 

Thai Industrial Standards Institute 
Department of Science 
Ministry of Industry 
Rama VI Street 
Bangkok 4 

United Kingdom (BSI) 

British Standards Institution 
2 Park Street 
London W1 A 2BS 

USSR (GOST) 

USSR State Committee for Standards 
Leninsky Prospekt 9 
Moscow 117049 


Vietnam Socialist Republic of (TCVN) 

Department de normalisation 
Comite d’Etat des sciences et techniques 
39, rue Tran Hung Dao 
Ho Chi Min City 


Spain (IRANOR) 

Institute Nacional de Racionalizacion y 

Normalizacion 

Serrano 150 

Madrid 6 

Sudan (SSD) 

Standards and Quality Control 
Department 
Ministry of Industry 
PO Box 2184 
Khartoum 

Switzerland (SNV) 

Association suisse de normalisation 
Kirchenweg 4 
Postfach 
8032 Zurich 


Turkey (TSE) 

Tiirk Standardlari Enstitiisu 
Necatibey Cad. 1 12 
Bakanliklar 
Ankara 


USA (ANSI) 

American National Standards Institute 
1430 Broadway 
New York, NY 10018 

Venezuela (COVENIN) 

Comision Venezolana de Normas 
Industriales 
Av. Boyaca (Cota Mil) 

Edf. Fundacion La Salle, 5° piso 
Caracas 105 

Yugoslavia (JZS) 

Jugoslovenski zavod za Standardizaeiju 
Slobodana Penezica-Kreuna br. 35 
Post. Pregr. 933 
11000 Belgrade 
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Tab'tf 2 f ^c\ locountf) ofongin for some standards 
codes (BSI. I97i.) (Letters are those that prcfic stan- 
dards code, eg BS ) 


ABVT 

Brant 

AISI 

USA 

ANSI 

US\ 

API 

US\ 

AS 

Australia 

ASME 

US\ 

AWS 

USA 

BDS 

Bulgaria 

BDSS 

Bangladesh 

flNS 

Barbados 

ns 

United Kingdom 

BST 

Sweden 

CAN 

Canada 

CAS 

Central Africa 

CEE 

International 

CLI 

lial) 

CEMA 

Canada 

CGA 

Canada 

cosn 

Canada 

CISPR 

Intemalional 

CkS 

South Africa 

CNS 

China 

COOELtCTRA 

Venezuela 

COPANT 

Pan America 

COVENIN 

N enemela 

CRS 

Costa Rica 

CSN 

CzcchosIotaKia 

CUNA 

lial} 

C^S 

C>prus 

DEMKO 

Denmark 

DON 

Mexico 

DGNT 

Bolls a 

DIN 

German) ER 

DS 

Denmark 

PLOT 

Greece 

ES 

Ethiopia 

ES 

Arab Republic of Eg>-pt 

rURONORM 

r uropcan 

Pod 

USA 

GOST 

USSR 

GS 

Ghaiu 

I 

Portugal 

ICAITI 

Central America 

ICONTIC 

Colombia 

lEC 

International 

IEEE 

US\ 

INANTlC 

Peru 
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Table 3.5 {conlimted) 


INDITECNOR 

INEN 

lOS 

IRAM 

IRS 

IS 

I.S. 

ISIRI 

ISO 

1ST 

JIS 

JS 

JSS 

JUS 

KEMA 

KS 

K.S. 

KSS 

LS 

LSS 

MBS 

MI 

MNC 

MS 

MSZ 

NBN 

MBS 

NC 

NEMA 

NEMKO 

NEN 

NF 

NI 

NIS 

NM 

NORVEN 

NP 

NPR 

NS 

NSRDS 

NVS 

NZS 

ONORM 

OS 

OVE 

PN 

PS 

P.S. 


Chile 

Ecuador 

Iraq 

Argentina 

India 

India 

Eire 

Iran 

International 

Iceland 

Japan 

Jamaica 

Jordan 

Yugoslavia 

Netherlands 

South Korea 

Kenya 

Kuwait 

Lebanon 

Libya 

Malawi 

Hungary 

Sweden 

Malaysia 

Hungary 

Belgium 

USA 

Cuba 

USA 

Norway 

Netherlands 

France 

Indonesia 

Nigeria 

Morocco 

Venezuela 

Portugal 

Netherlands 

Norway 

USA 

Norway 

New Zealand 

Austria 

Oman 

Austria 

Poland 

Philippines 

Pakistan 


(continued) 
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T iblt 3 *> icontimuifi 


SVA 

Australia 

s\tts 

South Africa 

s\r 

USA 

s\s 

Saudi Arabia 

SEMkO 

Sweden 

Sl\ 

Swilzcfland 

SIS 

I inland 

SI 

hncl 

SLS 

Sri Lanka 

SN 

Swiizertand 

SS 

Sweden 

SS 

Sudan 

SSS 

Svrn 

ST \S 

Rumania 

T6L 

German Dcmotraiic Republic 

TIS 

Thailand 

TS 

Turkey 

TTS 

Trinidad A. Tobago 

LL 

USA 

lU 

Canad i 

LNI 

Spam 

UML 

llalv 

IM 

Ital) 

LMT 

Uruguay 

tTI 

1 r mcc 

NDt 

Germ in> I R 

MS 

Sweden 

NSSt 

Swuretland 

zs 

Zambia 


network (ISONET) ihai would work towards a common data control method 
for at least the intcrnaitonal standardizing bodies ISO (1977c) lists those 
bodies in ISONCT at that lime 

lEC. the International Electro-icchnical Commission, has a formal agreement 
with ISO to coser on!) electrical and electronic activities, leaving all others to 
ISO 11) 1978 the lEC had published more than 1400 standards documents 
jrC(1978i)isthehstofthese An annual reporUs published (e g IfC, 1978b). 
which lists panels, ofTicials new issues plus drafts in consideration Another 
document (lEC. 1979) presides Ihcoflictal guide to nctisilics 

When seeking the existence of a suitable standard to use it would be of value 
to be able to consult a master global index However, this docs not appear to 
exist, prcsumabl) because of the immense number of standards already issued 
before electronic data base methods were introduced Tlic usual procedure 
would be to consult the most recent annual index volume for selected agencies 
(eg BSI. 1979, ANSI, 1977. SAE, 1977) The BSI publishes a monthl) biblio- 
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graphy of standards received into its library (BSI, 1978). Chumas (1974) may 
be valuable in such a search. Table 3.5 provides a finder of the country of origin 
for given standards letters. 

Attempts have been made in the USA to index internal voluntary engineering 
standards: Slattery (1971, 1972) indexes over 25,000 standards issued by 
hundreds of US agencies; GSA (1978) adds to these by listing those issued by 
US Federal Departments for their purchasing requirements; and Chumas 
(1975) lists standardization activities of 580 internal organizations. The value 
of contact with a standards librarian cannot be overstressed as an efficient means 
to make a standards search. lEC and ISO are not the only international stan- 
dardizing bodies. Because measurement technique applies to all endeavour it 
may be necessary to investigate the standards issued by other bodies. ISO 
(1977c) lists these bodies giving a brief description of their operation and scope. 

Somewhat surprisingly, because all practising technologists make extensive 
use of standard specifications, with the possible exception of within the USSR, 
the subject is not formally taught in its own right. Training is more likely to be 
provided on a rather ad hoc basis with short courses appearing as a new demand 
appears. 


3.6 NATIONAL MEASUREMENT SYSTEMS 

3.6.1 The Concept of a National Measurement System 

A complex system of measuring capability and application exists in techno- 
logically advanced countries. Until recent times this system has not been seen 
as an identifiable entity nor has it evolved with guidance. In the late 1960’s the 
concept of the national measurement system (NMS) came into prominence in the 
USA. It is ‘that system of activity that can be given the credit of enabling manu- 
facture, commerce, trade and communication to develop with some degree of 
compatibility between the different sectors of a nation’s economy and in 
international arrangements’. 

The NMS of the USA has been studied formally at depth (see Huntoon, 1967 ; 
Compton, 1973), data having been published by Sangster (1976a, 1976b) in the 
form of ‘impact matrices’ showing how measurement of a specific kind, such as, 
say, temperature or electricity, relate to specific sectors of the national effort. 

The value of the NMS of a particular country can be seen simplistically by 
the value of measurements made in a year in that country. Measurements are 
very diffuse and extensive in demand : they often tend to go unnoticed. Huntoon 
(1967) suggests that in the USA (those figures are quoted as they appear to be 
f e only set available) 20,000,000,000 measurements are performed each day. 
n 1965, the USA industrial sector invested around 3 % GNP into measurements, 
1 is figure increasing at around 1 % per annum. Data on properties of materials. 
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a small part of the total measurement needs, consumed 5% GNP, also in- 
creasing each year Huntoon stated that those industries that heavily invested m 
measurements usually showed the greatest productivity Many products simply 
could not be made without automatic measurements to feed the control loops 
In that study the US user appeared to be willing to pay 3 % GNP for measure- 
ments Finkelstem {1976) provides some data on the extent of measurements in 
the UK 

Published figures having the same degree of research expended on them do 
not appear to have been compiled for other countries, but several speakers have 
occasionally declared that their own countries ratios are similar to those given 
above 

The USA NMS study arose because of three needs for quantified data relating 
to effort expended on measurements The first was to obtain better under- 
standing of basic measurements and their standards, the second was related to 
data and standards on materials, and the third reason was for information on 
the technological standards and measurements Norden (1975) gives the 
following reasons 

(a) to develop a structural organization for the system under consideration, 

(b) to identify and quantify the importance of the technologies which use the 
NMS, 

(c) to study the second- and third-order effects of the NMS, i e on politics, 
society, economics, environment, etc , 

(d) to identify potential measurement problems which may arise in tech- 
nologies within the NMS 

Having been shown the extent and cost of the NMS and how it impacts on so 
much of a nation’s activity it might eventually be expected that more systematic 
effort will be devoted to measurements and their effective application 

3.6.2 Traceability, Calibration, and Evaluation 

The classic instance that highlighted the ne^ for traceable aud standardized 
measurements comes from American history (Cochrane, 1966) In I904ajunior 
night watchman, in a building of the National Bureau of Standards, Washington 
City, attempted to put out a fire but was unable to fit the hose end to the hydrant 
because the threads were not of the same size Soon after this event a disastrous 
fire at Baltimore again demonstrated the need for standardization when out-of- 
town fire appliances could not use their equipment for the same reason A 
subsequent study showed that at that time there were not less than 600 different 
hose sizes in current use 

Although the concept of traceability of calibrations for measuring instruments 
IS important it seems that comparatively little has been written about it, text- 
books on metrology and instrumentation rarely mention it Stem (1970) is an 
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account of the practical pitfalls about traceability that arise when it is assumed 
that the apparently traceable measuring instrument is regarded as infallible. 
Daneman (1975) is a good general statement; it also includes a useful biblio- 
graphy. Julie (1965) describes traceability to the NBS in the USA and Tagaya 
(1977) covers the situation in Japan. 

Traceability is very much a feature of the accounting aspect of an instrument 
as it involves the certifications of calibration values used; values which are not 
automatically a feature of an instrument but must be assigned by a calibration 
procedure. 

A calibration is said to be traceable if it can validly be traced back along a 
defined line of increasingly more certain calibrations to the primary standard, 
or standards, used in the SI system. This means that any instrument that is 
traceably calibrated and that is at the same level will have the same assigned 
values— within the uncertainty allocated to it. 

The very best instrument has no traceable validity if it cannot be proved at 
any time, especially after a failure, that its measurement values indicated are in 
the official traceable line. If they are then any consequent malfunction or damage 
to the instrument will not upset the validity and use of past values taken with it 
and the instrument can be replaced or readjusted to be the same. Ensuring 
traceability for an instrument is akin to taking out insurance before the disaster 
occurs. Loss of operation of an uncalibrated and non-traceable instrument 
means that data previously taken with it cannot be accepted for there is no 
longer any means to re-establish the instrument calibration after repair or 
replacement to give the same readings. Discrimination and repeatability will be 
regained but accuracy is lost. 

An essential step to obtaining traceable calibration is the existence of a net- 
work of laboratories that can perform the service. This may begin with the 
national physical standards authority, that holding the primary standards, 
performing all of the necessary calibrations but the need soon expands beyond 
that laboratory’s capability. At that stage an organization is established to 
control other capable laboratories. The procedure of authorizing other labora- 
tories having capability in stated areas to make these traceable calibrations is 
generally known as accreditation. As Volume 2 deals with this topic it need not 
be discussed further here other than to put it in perspective. 

The instrument that has been calibrated under traceable conditions is not, 
however, always a firm point of measurement certainty for it was calibrated 
under certain stated conditions where the influence quantities and other 
perturbing parameters were controlled. It must not be forgotten that it may not 
e providing the same calibration values when placed under service conditions. 
The value of a calibrated instrument under different conditions to that of the 
ca ibration has not been given the degree of study that it deserves. Indeed, it is 
common practice to determine the effect on the instrument of influence quanti- 
les at the time of the test but it is rare for the instrument user to explore 
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thoroughly and adequately what influence properties are present at the time of 
subsequent use Once certain forms of noise data are present m the output signal 
they cannot be eliminated by post-detection processing. They must be eliminated 
at the sensing interface stage 

Effective calibration is sometimes better performed on site using mobile 
laboratories (Daneman, 1978, Morton, 1978) or a temporary calibration 
laboratory 

In order to retain controlled uncertainty throughout the traceable chain each 
stage needs to have less uncertainty than that below it there was once a rule m 
vogue to use stages ten or more times better at each level, but this rule has come 
under fire in recent times as being too optimistic Often an external agency will 
develop a piece of apparatus that appears to be as good, if not better than, the 
official standard It cannot be given a traceable calibration to the ultimate of its 
capability, nor can it be used to replace the official standard Legal and experi- 
mental processes will be initiated and eventually the apparatus might be adopted 
as the new standard, but this takes lime The process usually begins when a 
measurement gap develops between users and the standardizing groups and 
then, as industry and science use their own resources to solve the problem, the 
situation changes to that of a measurement pinch (see Cochrane, 1966) 
Standards laboratories are unable to keep far enough ahead at all times for all 
required variables due to their limited resources and the time constants of the 
development processes involved 

Calibration proves the validity of an instrument to provide an accurate 
indication of the variable u is made to quantify Such factors as the ability of the 
instrument to perform for long periods of operation, to withstand reasonable 
shock loads, temperature excursions, relative humidity changes, and more 
factors are not strictly matters of calibration In recent decades there has arisen 
the concept of evaluating instruments, along with their manuals, servicing 
features, and any parameters of importance 

It IS a costly matter to evaluate an instrument but the cost can be very worth- 
while when compared with the costs that use of the instrument carries with it 
Figure 3 3 shows the bulk of the cost sources that are associated with an instru- 
ment during Its life cycle Evaluation will help to minimize many of the areas of 
cost shown The place of evaluation is in situations where the instrument plays 
a vital role or where instruments are to be used in situations where eventual 
contractual disputes may arise An evaluated instrument design is more likely 
to bear post-event scrutiny than one not tested in this way 

It IS an unfortunate fact that the general quality of much of manufactured 
instrumentation is not up to the expectations of the customer at the time of 
delnery Moss (1978) has studied the quality of contracted instrumentation 
delivered to a major USA agency In the study period around 25% of delivered 
instrumentation was rejected at the acceptance test stage and one out of ten 
items needed attention to obtain proper operation This is for contracted 
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Figure 3.3 Chart showing sources of cost arising through use of an instrument 


products which would be expected to be better than ‘off-the-shelf lines’ sold on 
the open market. Evaluation is part of the quality control of instrumentation 
and it is to be expected that more will be conducted as time passes. Evaluation is 
described in greater detail in Volume 2. 

3.7 RELEVANT INSTITUTIONS AND ACTIVITIES 

To complete this general introduction to the practice and administration of 
measurements it is of value to consider the kinds of institutions and organized 
activities that relate closely to some factor of measurement as a distinct entity. 
It is not possible to list all activities here since they number too many. 

Mention has already been made of the group of national- and international- 
level laboratories that maintain the primary standards and of their sister 
organizations that organize and issue standards specifications. 

A third group that is steadily growing in size is that formed by those national 
organizations that operate accreditation schemes. The first in existence at 
national level, was the National Association of Testing Authorities (NATA) 
in Australia— for more about accreditation see Volume 2. 
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An additional arrangement exists in the USA in which standards laboratories, 
at all levels, can voluntarily become members of the National Conference of 
Standards Laboratories (NCSL) This organization has over 350 member 
laboratories, a few being in other countries It conducts its business on a volun- 
tary basis, the NBS supplying the secretariat, and arranges a regular annual 
conference Work is mainly conducted through numerous subject and regional 
committees Subjects attended to include education and training, measurement 
requirements, national measurement requirements, laboratory evaluation, 
biomedical safety, calibration systems management, measurement assurance, 
product design and specifications, calibration laboratory automation, and 
recommended practices A considerable amount of research and investigation is 
undertaken, much of which is published in the monthly NCSL Newsletter 
At least 40 countries have professional institutions that cater for the needs of 
persons engaged m measurement and instrumentation pursuits These may be 
primarily concerned with or have a major committee operating in this area 
Examples are the Institute of Measurement and Control (IMC) m the United 
Kingdom and the Instrument Society of America (ISA) Each body has its 
particular features and generally caters for varying groups of measurement 
interest ranging from scientific through industrial to sales oriented groups 
In 1965 the International Measurement Confederation (IMEKO) was 
formed This, a non-profit, non governmental organization operates through 
a secretariat situated in Budapest, Hungary Over 25 countries have member- 
ship through a suitable national professional body that has no official ties 
with the Government For example in the German Democratic Republic 
membership is held by the Gesellschaft fur Mess und Automatisxerungstechmk 
and in China by the Chinese Scientific Society for Measurements and Instru- 
ments Over the years since 1965 membership has spread from the original 
countries (Hungary, Poland, and USSR) to include firstly many northern- 
hemisphere nations and then in the 1970’s developing countries m the India- 
Asia region and those of Australasia However, the African and South American 
continents are poorly represented in the list of members 
IMEKO conducts its business by correspondence, through annual meetings 
arranged by the various working technical committees, and through regular 
international congresses IMEKO is one of the five sister members of the 
FIACC group, two others being the International Federation of Automatic 
Control (IFAC) and the International Federation of Information Processing 
(IFIP) 

Each of the above institutional arrangements is based on, and controlled by, 
a widespread membership forming some kmd of basically capital absent 
enterpnse They can provide interaction between people but are not easily able 
to conduct research and development undertakmgs 
Additional to the government institutions already mentioned, that possess 
established laboratories for advancing the science and practice of measurement, 
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there also exist several other unique laboratories. In India there is a major 
facility, the Central Scientific Instrument Organization (CSIO), working for the 
development and transfer into application via industrial marketing, of scientific 
and industrial instrumentation. CSIO, a branch of the national government 
scientific service, has around 1000 staff who are committed to producing instru- 
ment designs that are vital to the development of the country. Over one hundred 
designs have been developed to the well-engineered prototype stage ready for a 
manufacturer to take up for manufacture and marketing. CSIO also operates 
several regional instrument repair centres throughout India as well as training 
schools and international aid programmes. 

At CSIO emphasis is on the development of more conventional, already- 
existing instruments. In the UK the Sira Institute formerly the British Scientific 
Instrument Research Association, set up through partial funding from govern- 
ment plus subscriptions from subscriber trade organizations, has a brief to 
advance instrumentation in all of its phases. Efforts over the past decades 
include design and research on new instruments, services to the general public, 
evaluation of instruments, provision of training, advice to requests, and supply 
of testing and calibration services. 

The work of a very small number of private consultants, who have become 
expert in such matters as international laboratory design, instrument design, 
marketing, survey work, and training, must also be recorded. These persons have 
accumulated experience that places them in a unique position in matters of 
measurement and instrumentation. 

This brief account would not be complete without mention of the value of 
trade fairs and exhibitions organized for the trade to display its wares. Instru- 
mentation does occasionally have events specifically devoted to it but more 
generally it would find a display outlet in either control, optics or fine-mechanism 
oriented events, or within exhibitions concerned with applications that make 
use of measurement. Many such events combine a conference with their 
exhibition. 
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E.-G. WOSCHNI 


Signals and Systems in the Time and 
Frequency Domains 


Editorial introduction 

Measurement is the procedure by which information is extracted about parameters of a 
system, mapping the basic information entity into a meaningful knowledge statement. The 
measurement information is conveyed from the system to the point of observation in the 
form of signals, A considerable body of knowledge now exists for processing signals: this 
chapter provides a condensed summary of that knowledge. It embodies elements taken 
from many subject groups— namely, signal theory, information theory, communication 
theory, transfer function theory, and more. The boundary of these topics is such that they 
overlap each other to a great extent. 

The Chapter is intended to promote awareness of the many mathematical techniques 
that can be brought to bear at one or many stages of a measuring instrument data handling 
chain. It is provided to act very much as a point of mathematical reference for the rest of 
the handbook. As such the reader will find many of the topics are used in subsequent 
chapters that often extend this material into the special area of interest to the subject in 
discussion. 

The symbols and terms of mathematics are generally international and thus are, to a 
large degree, uniformly used. Different usage does, however, occur from writer to writer. 
In the light of there being no single absolute nomenclature standard to work to, it was 
decided that authors use that with which they are familiar provided adequate definition 
is provided in their contribution. In general, each chapter stands alone as a contribution 
containing sufficient preamble to introduce the topic. 


4.1 INTRODUCTION 

As represented in Figure 4.1, the measurement-technological task is to establish 
the characteristics of actually occurring input signals x,. from known (measured) 
output signals y^. Signals having several components, for example, can be 
summarized to form signal vectors x,y, z, in the same way as can the disturbances 
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Figure 4 I Task of measurement 


z. The output quantities are a function of the input quantities, and the relation- 
ship between them is given by the behaviour of the measuring system The aim 
IS to have a certain behaviour, which is represented by an ideal mathematical 
operation 0,d 

= 0^{x} (4 la) 

In the individual case, this operation may be a constant with one dimension, to 
give an example, but it may also be a differentiation or integration, as for instance 
in the case of measuring devices for averaging 
The practical measuring system with the connection 0,„i between its output 
and input quantities also includes real, falsified output quantities which depend 
on the disturbances z 

= 0,„,(x, y) (4 lb) 

Hence, an error t occurs gi\en by 

c = y,d - >.,.i (■* ic) 

In this chapter, these relationships will be dealt with in detail, m particular those 
with time-varying signals The present discussion has the following four 
objectives 

(a) description of the signals by characteristic values and functions, 

(b) description of measuring systems by means of characteristic values and 
functions, 

(c) description of the errors and deduction of quality criteria, 

(a'f means ibr optimizing ifie system, tfiai is, ibr minimizing errors 

Objective (a) is discussed in Section 4 2, the other objectives, which are based 
upon the first, are dealt with in Sections 4 3 and 4 4 Reference is made in each 
case to descriptions in the time and frequency domains Figure 4 2 shows the 
objectives which have been outlined above and which arc tailored especially to 
the following two fundamental problems of measurement technology 

Signal identification In Figure 4 2a the aim is to determine the signal parameters 
of the input signals m terms of known (measured) output signals y, where, if 
possible, disturbance variables should have no influence whatsoever 
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Figure 4.2 (a) Identification of signals; (b) identification of systems 


System identification. Figure 4.2b, on the other hand, depicts measurement of 
the parameters of a system. In that case input test signals x and associated output 
signals y are both fed to the measuring system. The parameters which are 
characteristic for the system have to be determined from the output quantities 
of this measuring system y^,. 


4.2 SIGNALS 


4.2.1 Classification of Signals 

In addition to the useful signal x, interfering signals z occur. Both types of signal 
are carriers of information: wanted information in the case of the useful signal 
and unwanted in the case of the interfering signal. Both signals can be treated 
with the same methods. 

For signals to be carriers of information, is necessary that certain parameters 
a, of these signals be allowed to change: 

in a signal x(n,.) it must be possible, for the purpose of information 
transmission, to change these information parameters. Information 
parameters are those parameters of the signal upon which the 
behaviour of the information to be transmitted is mapped. (4.2) 

Table 4.1 gives a survey of signal designations: A distinction is made between 
analog signals, which arc signals having no quantization of the information 
parameter, and discrete signals in which, because of quantization, the informa- 
tion can assume a finite number of values. An important special case is that 
involving binary signals in which the information parameter can take only two 
discrete values: 0 and 1. 






Table 4 I Classification of signals 
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I'icurc 4.3 (a) Branching; .tj = .Vj = .v,; (b) adding or 
Mihtraction; .Vj = .v, + .Vj — .Vj: (c) multiplication; X 3 = 
.V, • .\s: (d) division; .v, = ■'f i/v. 


In timo-dcpcnclcnt signals--most signals arc lime dependent or arc converted 
to become time-dependent signals by scanning, as is the ease in television — the 
information parameter can cither change at any time (continuous signals), or 
clitmgcs arc possible at given cycle times only, due to time quantization. 

If the entire behaviour of the signal, including future behaviour, is known, the 
signal is .said to be a determiiu'd one. The transmission or measurement of this 
vignal. naturally, docs not produce any gain of information. This type of signal 
plays a major role as test signals (c.g, impulse function, step function). Contrary 
to this, a signal to be measured has little a priori information. Signals with an 
unknown flow are called non-del ermined signals. If they arc described by a 
prob;diilily distribution, they may also be called stoclmsiic signals. 

To trace and represent the flow of the signal, signal flow graphs arc used. 
I'lgure 4.3 gives a survey of the graphical representation of the branching, 
addition, subtraction, multiplication or division of signals. Signals may be 
represented in the time and frequency (spectral) domains. For the formation of 
st;itisiical characteristic values, time or statistical means arc used. Furthermore, 
signal representations arc applied by making use of geometrical relationships. 

4.2.2 Signals in the Frequency Domain 

4.2.2. 1 Perithiic signals'. Fourier spccirum 

\ \ery important and determined fundamental signal, which is al.so of major 
importance ns a lest signal, is the harmonic oscillation 

.V -- JC' sinfojf -f (;i) 


(4.3a) 
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Figure 4 4 Harmomc oscillation (a) indicator representation (b) time representation 


where ^ is the amplitude, a(=2nf) is the angular frequency. T(= \jf) is the 
oscillation penod and <p is the phase angle (often zero) Representation in the 
complex plane, as shown in Figure 44, yields, according to the so-called 
'symbolic method’, complex and oriented indicators x and, consequently, the 
relationship 

X ^ exp[j(cjt + (p)] - ^ cos(ur + <fi) + sinfcui + cp) = X e^®' (4 3b) 
which IS equivalent to equation (4 3a) and which has the onented complex 
amplitude 

(4 3c) 

According to Figure 4 4, the directed quantity can be split into real and 
imaginary parts 

S = A +IB\X\ = X = ^(,A’ + ly = arctaii(B//)) (4 3d) 

The advantage of the symbolic method consists, above all, m the possibility of 
having a simple and easily understandable addition of sev eral partial oscillations 
having the same frequency (Woschni, 1981) Periodic signals are particularly 
useful as test signals, because the same signal behaviour is obtained after each 
cjcle penod T and can be observed on an osalloscope synchronized with T 
According to Founer, it is po^ible to represent this type of signal having a 
cyclic time behaviour x(0 by a senes of sinusoidal and cosinusoidal oscillations 
with frequencies which are multiples of the fundamental frequency coq gi'eo by 
tJo = 2n:/o = 27t/T (44a) 

as shown in Figure 4 5, which depicts two sinusoids (of many) that form the 
cyclic rectangular oscillation 

x(0 = i/lo + S [>4,cos(nt«JoO + B,sin(«a»o0] 

m t 


(44b) 
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Figure 4.5 Representation of the rectangular oscillation by sinusoids 


with the amplitude spectrum 

Q = J{Al + B^) (4.4c) 

The Fourier cocfiicients can be calculated from the relationships 

2 

A„ = -\ x(l)cos(ncuo0dt (4.5a) 

‘ J-r /2 

2 

B„ = — x(t)s\n(>iWot)dl (4.5b) 

1 J-T/2 

From equations (4.5a, b), one can see that there are only cosine terms for even 
time functions x(t) = x( — t), and only sine terms for odd functions x(f) = 
-.\(-i). 

Transformation with the aid of Euler’s theorem, leads to the complex Fourier 
scries 


A'(0 = ^ Z -^(j»Wo)e^''“"' (4.6a) 

* n= — oo 

with the complex coefficient 

r+r/2 

^0"Wo)= A-(/)c dt (4.6b) 

J~T/2 

and the amplitude spectrum (C„i( given by 


IQJ = 


A'fjnwo) 


(4.6c) 


In addition to the system of orthogonal functions based upon sine and cosine 
functions other orthogonal systems have recently been introduced, in particular 
the Walsh functions (Harmulh. 1970) which arc shown up to eighth order in 
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Figure 4 6 Walsh functions up to eighth order 

Figure 46 From equations (44) and (4 5), respectuely, the corresponding 
relationships are developed 

x(;) = yv^+ Y.LW'^.caU;) + (47a) 

» 1 

with the Walsh coefficients and 


1 

r+T/j 

x(t)cal„(t)dt 

(4 7b) 

J J 

'-r /2 


1 

e+r/2 


= J 

x(0 sal„(r) dt 

'-r /2 

(4 7c) 


As one can see, the Walsh spectra, which are also called sequency spectra, are 
superior to the Fourier spectra, in that the multiplication with sine and cosine 
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Figure 4.7 Experimental determination of the Fourier and Walsh coefficients 


functions, respectively, is obviated by a simple reversal of signs. Therefore, these 
spectra can be determined more easily by experiment than the Fourier spectra 
(Figure 4.7). 


4.2.2.2 Non-periodic signals, spectral amplitude density, Fourier transform 

Non-periodic functions are of a great importance both as determined signals, 
i.c. test signals (step function, impulse function) and as non-determined signals 
(unknown signals which are to be measured). A discrete Fourier spectrum exists 
for periodic signals, whereas a continuous spectrum develops for non-periodic 
signals, it is obtained from the Fourier series (equation (4.6)) by passing to the 
limit; 

T CO ojq = 2nJT -♦ dco \jT drojln naio -* m (4.8a) 
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In this case 


= J Jc(t)e '"'dt = F{x(l)} 

(4 8b) 

x(0 = ^J = 

(4 8c) 

Here the integrals are to be understood as the Cauchy principal value 



F and F * are the Fourier and inverse Fourier transforms respectively 
Physically, represents the complex amplitude related to dw, and is 

therefore also called spectral amplitudedenstty having thedimensionofamplitude 
per frequency inter\al, 1 e V s and V/Hz, respectively Depending on whether 
the frequency scale [Hz] or the angular frequency scale [s '] is chosen the 
values will differ by a factor 2n 

An identical calculation can be made for Walsh functions This, however, 
leads to the sequential amplitude density (Harmuth, 1 970) 

The Fourier transform has the following important properties and theorems 
The transform is linear, i e 


(49a) 

(where o- is the sign for ‘assignment’) Fora change of the time scale, the relation 
ship IS 


(4 9b) 

Particular importance should also be aitnbuted to the displacement theorems, 
namely the time displacement 

- to)o- ^0<u)exp(-j(ufo) 

(4 9c) 

ana' ffte frequency shift 


X(l)cip0wlo)0 - tOo)] 

(49d) 

For differentiation, one obtains 


o- 

(49e) 

and for convolution 


[ — t)dTO- 

(4 90 
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4.2.2.3 Spectral power density 

For non-deterministic signals, let us assume in the following that they are 
stationary, that is, that their time averages are not time-dependent quantities. 
To identify such signals x(t), the spectral power density is used. It is defined 
as the part of the power AP, which falls into a differentially small frequency range 
Aco, that is. 


„ , , AP dP 

5^:cx(w) = hm — = — 
Aco dco 


(4.10a) 


In contrast to the spectral density, which cannot be determined in the case of 
random signals, the spectral power density is a real-valued function of the fre- 
quency CO. It does not contain any phase information. The latter is lost in the 
calculation of the average value, which is necessary for the formation of the 
power. This can also be seen from the relationship existing on the basis of 
Parseval’s equation, by averaging over a time domain T (Zadeh and Desoer, 
1963; Woschni, 1973): 


„ . ^ 1 |XGu>)|^ 

= r- hm 
r-® 


27 


(4.10b) 


Consequently, the spectral power density is real and always positive, and it is 
an even function for which = S^^(—co). Since the phase angle is missing, 

S^^{co) does not contain the full information about x(t); a reverse calculation is 
not possible. The power P of the entire signal existing in the whole frequency 
domain, can be calculated on the basis of Parseval’s equation, for the energy W 


W = 


x^(r) dt 


=-r 

271 




(Zadeh and Desoer, 1963; Woschni, 1981): 


P = x^(t) = lim — r x\t) df = f lim — — ^ dco = f dco 
r-.„27rj_r 27c 27 J_^ 

(4.10c) 

From equations (4.10a,b) it can be concluded that a random phenomenon 
contains a periodic component of frequency coq with amplitude X. Dirac delta 
functions will develop in S^^{co) at the frequencies ± coq (Woschni and Krauss, 
1976): 

S:t..(w) Lo = CO I - COp) (4. 1 Od) 

Furthermore, from equation (4.10c) it follows that the power density S^^(co) 
must decrease rapidly from a certain critical frequency and must vanish at higher 
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frequencies because of the requirement of boundedness of the power P. Depend- 
ing on the critical frequency , a distinction is made between narrow-band and 
wide band signals 

4Z2.4 Practical miestigations 

The foundation for investigations in practice is the possibility of determining 
the charactenstic functions and values by experiment For the registration of 
spectral amplitude density and power density, respectively, one applies the same 
principles In the case of filtering, use is made of several filters of bandwidth A(o 
which are staggered in the frequency and whose outputs will be connected one 
after the other to a display unit for the voltage which is proportional to |^0<y)l 
According to Figure 4 8a, it is likewise possible to synchroni 2 e the switch with 



Figure 4 8 Spectral analyser (a) switched filter pnnciple of operation, (b) vanable 
centre frequency of single filter method, (c) formation of the power density 
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a sweep voltage that deflects the beam of an oscilloscope in the x-direction, 
proportionally to co; consequently, the spectrum |^(j<w)l = /(") can be re- 
corded. 

Figure 4.8b shows another method which is characterized by the fact that 
only one filter of bandwidth Aco and of centre frequency coip (which is also called 
intermediate frequency is required. Tuning is carried out by mixing with the 
continuously tunable auxiliary frequency o)^, where the following frequencies 
will be allowed to pass: 

CO = (W,F - (Oj^; o)* = fy,F + (4.11a,b) 

The low-pass filter at the input eliminates the image frequency co*, so that one 
can cover the entire frequency domain required by tuning coy^ . Synchronizing the 
sweep signal with the x-axis sweep will generate the amplitude spectrum on the 
oscilloscope display. It is important to ensure that the filter has enough time to 
respond to the starting surge. The transient time 1,^ of the filter needed is, 
according to the Shannon sampling theorem (Section 4.3.7): 

C„ = 1/2A/ = tt/Aco (4.11c) 

The sampling of the spectrum must, therefore, be carried out relatively slowly 
(low sweep frequency of the oscilloscope), for which reason long-persistence 
cathode ray oscilloscope methods are required. During the scanning run, the 
spectrum must be practically stable; it must be a steady signal in the statistical 
sense. 

In addition to filtering and variable acoustic frequency methods, other 
techniques are used which make use of computers to implement the Fourier 
transform according to equation (4.8b) directly. This will be discussed in detail 
in Section 4.2.4, where specially adapted techniques for the fast Fourier trans- 
form will be dealt with. The computing advantages which arise when the 
amplitude sequential spectrum defined by the Walsh functions is used instead 
of the amplitude frequency spectrum, have already been pointed out in connec- 
tion with Figure 4.7. While the techniques described in Figure 4.8 supply the 
amplitude response of the spectrum |X(ja))( only, it is possible to obtain 
additional phase information from the relationship cp = arg[.£(jco)] using 
computing methods. 

To display the power spectrum the formulation AP/Aco = Ax^(t)/Aa) is 
implemented, as outlined in Figure 4.8c, between the filter output and the input 
of the display unit and the oscilloscope. 

To give some examples, consider some more signals which are used as test 
signals as well as for the approximate representation of measurement signals 
(refer to Section 4.2.7). 

For a periodic sequence of rectangular pulses having the pulse width: 
repetition ratio x = AT/T{ses Figure 4.9a), after substitution into equations 
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(b) 



Figure 4 9 (a) Sequence of pulses (b) spectrum for t = 0 5 


(4 5a, b) and (4 4c) or (4 6b, c) and after an elementary calculation, one obtains 
pure cosinusoidal oscillations having the amplitudes 

= = {412a) 

miT 

Figure 4 9b shows the spectrum as well as the envelope curve for t = 0 5 The 
relationship with Figure 4 5 becomes immediately evident the smaller the 
pulsewidih.ie r„ = AT The pulseheightisstillcorrectly indicated, whereasthc 
region up to the first zero of the envelope curve For t ^ 0, a constant spectrum 
results because the first zero shifts towards to = co This case is significant as a 
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test signal. For t -» 0, the unit impulse becomes the Dirac function (5(f) with the 
normalization 



According to equation (4.6b), this has a spectral amplitude density XOw) = 1, 
with only cosine oscillations of constant amplitude occurring. 

Another important test signal is the unit step w(f), which will be explained in 
detail in Section 4.2.7. For this, because 

fO f < 0 
^ “ |l f > 0 

(from equation (4.6b) for the spectra] power density), .^(jm) = l/jo) showing 
that only sinusoidal oscillations occur with an amplitude which declines 
hyperbolically with 1/co. 

The power spectrum declines with lX(jco)\^ according to equation (4.10b). 
This means that, in the case of a periodic rectangular oscillation, the major part 
of the power lies within the frequencies up to the first zero, and that Sxx((o) is 
also constant for the unit impulse. The spectral power density for the unit step 
decreases very rapidly (as l/co^) with the rising frequency. Another very impor- 
tant signal is resistance noise (Johnson noise), the power spectrum for which is 
constant up to very high frequencies (more than 10^^ Hz) and assumes the value 
of 5((u) = kTRIn {k = 1.37 x 10"^^ W s K"*), T = absolute temperature (in 
K) and R = resistance (in D). Consequently, the noise voltage for a frequency 
bandwidth of Acu is: 

/4 

= l^kTRAco] (4.12b) 

For more details and further typical signals, see Section 4.2.7. 

4.2.3 Signals in the Time Domain 
4.2.3. 1 Mean values 

A time function .v(/) can be characterized by time averages of the nth order, 
which are also called moments of the nth order: 

^ = ^ J ^xXOdf (4.13a) 


or, for non-periodic signals: 
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For « = 1, one obtains the linear (arithmetic) average which, from the physical 
point of view, can be interpreted to be the zero-frequency component of the 
signal or, according to Section 4 2 21, the Fourier coefficient AJl 
Of particular importance is the average value for n = 2, which, as a mean 
square value according to equation (4 10c), represents a measure for the power 
The root from the mean square value is the effective value given by 

= (4!3c) 

The following relationship exists between the component x(r), the component 
x(t), and the mean square value 

^ + ?(0 (4 13d) 

For a harmonic oscillation, one obtains = ^/>/2 for the effective value 

4 2 3 2 Correlation function 

The correlation function ^(t) represents a generalized mean square value, where 
a function is multiplied by the function displaced by time t, and where the mean 
value is formed If this function is the same function x, we call it an autocorrelo' 
tion function where 

= Irm f x((>x(l + t) d( = x(0x(( + i) (4 14a) 
r-.® J-T 

It IS suitable for making statistical statements on the internal relationships 
between function sections, as is now shown m a survey of its typical properties 

(a) In averaging, the phase information is lost, as is the case for the spectral 
power density (Section 4222) Therefore, there are also direct relation- 
ships between the autocorrelation function and the spectral power 
density5„(a;), which will bcdiscussed in detail in Section 4 2 3 3 However, 
periodic components in the signal x{t) will be maintained without giving 
consideration to the phase position, because the following expression 
applies to the autocorrelation function of the harmonic oscillation, 
independently of the phase position 

= 2 -^^ cos((ur) = X^fi cos(wt) {4 14b) 

(b) The value for t = 0 represents, according to (4 14a), the mean square value 
and IS the maximum value of the autocorrelation function 

= (414c) 

The other threshold value for t -* oo is the square of the linear mean value 
lim (4 14d) 
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(c) Since it is of no significance whether the function x(t) in equation (4.14a) 
is displaced towards positive or negative times, the autocorrelation function 
is an even function; 

- t) = x(t)x(t + t) = x(t)x(t - t) (4. 14e) 

If two different signals x(t), ^(t) are being compared one with the other, the 
measure used for the statistical relationship between them is the cross-correlation 
function according to the definition: 

1 

= lim — x(t)y(t + T)dt = x(t)y(t + t) (4.15a) 

T-*co J ~T 

In measurement technology, the cross-correlation function plays a major role 
in solving system identification tasks during normal operation by means of the 
disturbances (see Section 4.3.8). The solution of this measurement problem is 
the foundation for adaptive systems (Davies, 1970). 

In contrast to the autocorrelation function, the cross-correlation function 
has the following features : 

(a) It has no even functions, but the following relationship holds; 

^xyij) = (4.15b) 

(b) It contains relative phase information concerning the two events x(£), y(f). 
In particular, the cross-correlation function of two harmonic signals with 
the same frequency disappears if the phase shift is + njl, as can be seen 
following substitution into equation (4. 1 5a). Likewise, the cross-correlation 
'of two harmonic oscillations is zero if the frequencies are unequal. 

(c) The limiting cases are; 


’/'*#) = 'I’yxi^) = (4.15c) 

lim = 3c(tj • ^ (4. 1 5d) 

X-*KO 

The experimental registration of the correlation function will be discussed in 
Section 4.2.3.4. 


4.2.3.3 Relations to spectral power density: Wiener-Chinchine theorem 

The autocorrelation function, like the spectral power density, contains no phase 
information; this is lost in both cases because of the averaging operation. There 
is a relationship between both functions, as is the case between the time behaviour 
of the signal x(t) and the corresponding spectral amplitude density XQo}), via 
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the Fourier transform, this relationship is known as the Wiener-Chinchine 
theorem (Woschni, 1981, Davies 1970) 


^ '"'dt = i 

(4 16a) 

1 ”s„(to)e<"'dm = 2;rF->{S„(a))) 

(4 1 6b) 

Since the autocorrelation function is an even function (Section 423 2), co- 
sinusoidal oscillations only occur Consequently, equations (4 16a,b) can be 
rewritten 

1 r* 

S„(ft))s=- 1 t^„(T)COS(GJT)dT 
n Jo 

{416c) 

“ 2 1 S,, co$(wt) dco 

Jo 

(4 16d) 

If the autocorrelation function in equations (4 16a, b) is substituted by the cross 
correlation function then the corresponding relationships with the cross- 

power density will be obtained 


(417a) 

= J S„0<u)s’“'diu = 2 jF' '{S„0<u)) 

(4 17b) 


4 2 34 Practical iniestigations 

The characteristic functions and values which have been introduced can be 

found by experiment 

For the measurement of the linear mean value x(0. use is made of either 
moving-coil instruments or transistor voltmeters having a series-connected 
integration link, compare Figure 4 lOa, with Figure 4 10b showing a simple 
analog circuit for finding the average value according to the equation 

using the assumption Ug le 7? > 1/toC 
The mean square value can be found by means of the principle shown in 
Figure 4 1 1 To obtain the square of a value use is made of either electronic 
circuits with a square-law characteristic, c g diodes or transistors (as m the case 
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(b) 



Figure 4. 1 0 Measurement of the linear mean value : (a) basic circuit ; 
(b) simple realization 


of transistor voltmeters), or a measuring device having a square-law charac- 
teristic, such as a soft-iron or hot-wire instrument (heating k P k, PR). 

Figure 4.12 shows the basic system for determining the autocorrelation func- 
tion of the cross-correlation function. Delay due to a delay section, multipli- 
cation, and averaging yield, according to equation (4.14a), the autocorrelation 
function (switch in position A) or, according to equation (4.15a), the cross- 
correlation function (switch in position B). Since the integration time T cannot 
be chosen to be infinite, only the short-term correlation function 

Wt)- — (4.19) 

will be measured in practice; under certain circumstances it reflects the actual 
behaviour of ij/(x) with sufficient accuracy (Section 4.2.4). In addition to the 
analog methods discussed, used to ascertain experimentally the characteristic 



Figure 4.1 1 Measurement of the mean square value 


Delay system 



B 


Figure 4.12 Experimental determination of the correlation function : (a) auto- 
correlation function (b) cross-correlation function tl'xy('r) 
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functions and values m the time domain, an increasing use of digital methods is 
evident In this case, the function is split up into various values (support 
values) at different times t, the sampling theorem (equation 4 1 Ic)) having to be 
observed With the aid of the corresponding relationships, it is possible, for 
instance, to calculate the correlation function point by point from the support 
values 

Further methods are based on the application of the Fourier transform, i e 
calculation of the correlation function from the power spectrum which has 
been found by experiment (see Section 4 24) 

Consider now some typical cases which are to stand as examples (see also 
Section 4 2 7) For the autocorrelation function of a rectangular pulse according 
to Figure 4 13a, 

The result shown in Figure 4 13b reveals that if the pulse width 27'i decreases in 
the limiting case -► 0, that is for the unit impulse 5(t), the autocorrelation 
function is also a delta function This also follows immediately from the calcu- 
lation of the autocorrelation function of white noise having a constant power 
spectrum S„(q)) = constant, as is the case for unit impulse For this calculation 
equations (4 16b) and (4 16d) are used 
The relationship between the width of the power spectrum and the corre- 
sponding autocorrelation function can also be seen if the autocorrelation 
function for narrow-band noise is calculated in accordance with Figure 4 14a 

= a [ e“*^ do) = 2a<Og — — * — 

J-®, COgt 

As shown in Figure 4 14b, the autocorrelation function becomes smaller as the 
noise bandwidth increases, degenerating into a delta function for white noise 



Figure 4 13 (a) Rectangular pulse and (b) corresponding autocorrelation function 
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1q) 




Figure 4. 14 (a) Narrow-band noise and (b) corresponding auto- 
correlation function 


This shows clearly that statistical relationships cease to exist if two noise signals 
are slightly displaced relative to each other. 

4.2.4 Relevance of Time and Frequency Ranges and Transforms Between them 

In the following, a summary will be given of the relationships established so far 
between the various signal representations in both the time domain and the 
frequency domain. Also consideration will be given to the possibilities of con- 
version indicated above. Table 4.2 contains a survey of such relationships. 

While the time function x(t) and the spectral amplitude density contain 
the full information concerning the signal, this is not the case for the functions 
resulting from averaging (autocorrelation function and power density). Phase 
information is lost due to averaging. For this reason, conversions are possible 
only in the direction indicated by the arrow. 

Conversions are possible, via the Fourier transform, between functions in the 
time domain and corresponding functions in the frequency domain. Therefore, 



Iwcen time and frequency domain signals fWoschni, 1973 
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it does not matter, as far as the significance of the statement is concerned, which 
of these two functions is measured, rather it is a question of convenience. For 
instance, it is useful to carry out measurements in optical communications 
predominantly in the time domain. In vibration measuring technology, however, 
it is preferable to carry out the measurements in the frequency domain. 

To accomplish the Fourier transform, digital computers are used at the 
present time. The basis for this is the fast Fourier transform (FFT). Using a 
transform specially tailored to fit the treatment in the digital computer, discrete 
Fourier transform programs have been established which save computing time, 
and where at least 1000 support points are quite usual. The calculation supplies 
the Fourier coefficients. As the number of support points is limited, the short- 
term correlation function is determined, which, however, is practically identical 
with the correlation function, provided the correlation time ^ T (refer to 
equation (4.19)). For more detail see Birgham (1974). 

The importance of the conversions has already been discussed. Mention 
should again be made of the fact that x(t) and X(ju>) are related in the same way 
as are and S^^(co). For instance, a constant amplitude density has the 
delta function, which is a time function, as an autocorrelation function, just as 
the constant power density in white noise has. 


4,2.5 Characteristics of Sigpnals : Using Probability Functions 


4.2.5.1 Probability function and probability density 

To describe randomly fluctuating events <^(0, use is made of characteristic 
functions which are based on the theory of probabilities. 

The probability distribution IF(x), which is also called the first-order distri- 
bution function, indicates the probability p that the signal <^(t) remains smaller 
than a barrier x, i.e. 


Wix) = pK(0 < X] 

(4.20a) 

The limiting values of the probability distribution 


lira IF(x) = 0 

— CO 

(4.20b) 

lim IF(x) = 1 

(4.20c) 


+ CO 


are also clearly understandable, because they mean the impossibility of a value 
smaller than — oo as well as the certainty of the occurrence of any signal value 
^{t). For continuous functions ^(t), the probability distribution W(x) is a 
monotonically increasing function. 
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The probability density \\(x) is the probability related to Ax for the fact that 
the values of the event c(t) are within a narroiw region near the value x, that is 

u(x) = ^ ^ <^ + Ax -» dx (4 2Ia) 

When comparing the Figures 4 I5a, and 4 I5b it can be seen that 

j B(u)dli= H'(a:) (4 21b) 

and 

dVV(x)/dx = h(x) (4 2Ic) 




Figure4 15 (a) Probability dislnbution and (b) corres- 
ponding protebility density 
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Taking into consideration the limit (equation (4.20c)) the normalization is 



(4.21d) 


As shown in Figure 4.15, the probability that ^{t) lies within the interval X 2 to 
X] is calculated by: 

p[xi ^ ^ < X 2 ] = W(x 2 ) — kF(xi) = f vv(x) dx (4.21e) 


For multidimensional distributions Xj, X 2 , • - - , x„, it is possible to introduce 
compound probability distributions lF(xj, X 2 , ■ - ■ , x„) and compound prob- 
ability distribution densities w(xi, Xj, . - . , x„): 


W(xi, X2, . . . , x„) = pKi(t) < X,, (^ 2(0 < X2, . . . , ^„(t) < x„] (4.22a) 

d” 


w(Xi,X2,...,X„) = 


dxidx2,...,dx„ 


W(xi,X2,-..,x„) (4.22b) 


Furthermore, conditional probability distributions lF(xi/x 2 ) and conditional 
probability distribution densities w(xi/x 2 ) are defined. They indicate the prob- 
ability that the value Xj occurs on condition that the value X 2 already exists. The 
following relationships hold for the compound probability density: 

w(x, y) = w(xly) - w(y) = w(y|x)w(x) (4.22c) 

Of utmost importance in practice is the Gaussian distribution density: 

where a = x(f) is the linear mean value and cr the standard deviation, related to 
the square mean value, x^(r), by 

= n/C^ - (4.23b) 

Figure 4.16 shows the Gaussian distribution density for a = 0. 


42.5.2 Relations to the mean values: ergodic theorem 

The expectation value £ of a function / (x) is defined as follows : 

E{f(x)} = f /(x)w(x) dx (4.24a) 

J — CO 



i:o 


iiKsniiooK or MrASURtMrvi scirsct 



{ifure4 lt> Cuuuan dmribution density 


1 or/(\) r- i' the moment ^f,ofn^hordc^lsobIaIncd Ofparliculanmportanct 
n the moment of first order, denoted as the linear phase space average x 

; = \(, = j,w = J x»(x)iix (•tyw 

and the squ.irc pliasc space aserape 

x-= - W, ^ £|x-) = J x-K<x)cJx (■!:■*<:) 

Accotdmph. the expectatjon value for ihe simultaneous occurrence of x,(i) 
and x.(f + t) IS obtained 

;(,ff)x.ff + :)- /:{x,f() x.(r -t i)| 
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If this is an ergodic event, that is, if the ergod ic theorem is satisfied, the phase 
space averages x" and time mean values x"(t) are 

oo 

£{x"} = x" = x"w(x) dx 
J — 00 

= x"(t) = lim f x%t) dl (4.25a) 

T-a>2/ J_T 

With equation (4.24d), a definition can be obtained of the correlation function 
which is based on the generalized phase space average: 

il/^y(T) = lim f x(t)yit + t) dt 

X-voo Zi J-T' 

= x(t)y(t + t) 


= x{t)y(t + t) 
• 4* CO /• + 00 


= f [ x(t)y(t + T)w[x(t), y{t + t)] dx dy (4.25b) 

For the particularly important Gaussian distribution density according to 
equation (4.23a), one calculates: 


Ml = X - x(l) 
M 


exp 


■Jlncr 


2(T^ 

— (x — ay 
2(7 


dx = fl (4.25c) 

2 . dx = + (7^ (4.25d) 


4.2.5.3 Practical investigations 

In order to register the probability distribution and density electronic majority 
decision elements having an adjustable threshold value x are used, as outlined 
in Figure 4.17. This can be done by either a triggering circuit or a voltage divider 
having a biased diode. With this, a normalization is to be carried out such that 
the corresponding conditions, equations (4.20c) and (4.2 Id), are observed. The 
arrangement is also suitable for recording on the oscilloscope screen, provided 
the voltage of the sweep generator for the x-deflection of the oscilloscope is used 
to control the threshold value x. The sweep frequency must be slow enough to 
ensure adequate averaging occurs. Oscilloscopes with long screen persistence 
times are used. 

By coupling several installations in accordance with Figure 4.17b, compound 
probability distributions Wfx, y) can also be recorded. For this purpose, the 
trigger outputs of one arrangement for each event x, y will be connected with an 
AND element and further processed as shown in Figure 4.17b (Woschni, 1968). 
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figure 4 17 (i) Registration of the probability distribution and (b) probability density 
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In practice it is possible to describe many events, at least approximately, by 
the Gaussian distribution of equation (4.23a). The probability for the fluctuation 
process to lie within the range — x < ^(0 < +x, or a — x ^ ^(t) < a + x, 
where o is a constant, is: 

rt-x « m < +>^3 “ tL 

To evaluate this the probability integral is used which is tabulated in the follow- 
ing form (Jahnke, 1960): 

(l)(x) = r exp(-«^)d^^ (4.26b) 

y / ^ Jo 

Extracts are shown in Table 4.3. 

It may have been found by measurement, for example, that the length of 
workpieces having an average value of u = 10 cm satisfies a Gaussian distri- 
bution and shows a standard deviation of c = 3 mm. What matters then might 
be the number of workpieces that lie within an admissible tolerance range of 
10 mm ± 4 mm. Evaluation according to equation (4.26) shows that 82% of 
the pieces are within the tolerance range, and that the remaining pieces lie outside 
this range. 


4.2.6 Geometrical Signal Representations 
4.2.6. 1 In Euclidean signal-space 

Signals with n components x,, . . . , x„ can be represented by a signal vector x in 
the n-dimensional space: 

X = (x„X 2 ,X 3 ,...,x„) (4.27a) 

where the end point of the vector determines the corresponding signal. 


Table 4.3 Gaussian probability integral (p{x) evaluated for 
a range of x 


-X 

0 

2 

4 

6 

8 

0.0 

0.0000 

0.0226 

0.0451 

0.0676 

0.0901 

0.1 

0.1125 

0.1348 

0.1569 

0.1790 

0.2009 

0.2 

0.2227 

0.2443 

0.2657 

0.2869 

0.3079 

0.3 

0.3286 

0.3491 

0.3694 

0.3893 

0.4090 

0.5 

0.5205 

0.5379 

0.5549 

0.5716 

0.5879 

0 

0.0000 

0.2227 

0.4284 

0.6039 

0.7421 

1 

0.8427 

0.9103 

0.9523 

0.9763 

0.9891 

2 

0.9953 

0.9981 

0.9993 

0.9998 

0.9999 
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For analog signals, use is made of Euclidean space, where the Pythagorean 
theorem applies Thus, for the magnitude of the vector, also called ‘norm’ m 
geometry, one obtains 


llxfl 



(4 27b) 


The various components are then the projections onto the different axes of the 
n-dimensional space, which is called signal space 

The notion of the distance d between two or more signals is of great practical 
importance This distance results as a norm of the difference between two signal 
vectors x and y as follows 

y) = II* - yll = ^ Z l-^v - (fi 27 c) 

The scalar product 


X y = Zx,y, 


(4 28a) 


can be used to write the angle a between the two vectors m the following way 


cos a = 


X y 
lixll llyll 


(428b) 


For continuous analog signals x(t), which are defined in the range a <b, 
one can accordingly indicate a norm 


lixll = 




(4 29a) 


Physically, it represents the square root of the energy of the signal A special 
Hilbert space is thus defined (Blumenihal, 1961) The distance between two 
signals corresponds to the root mean square error 


<J(x, y) = jx - y|| = - >(0P dtj (“• 29b) 

It is often used m measurement technology and cybernetics as a measure for 
the error (see Section 4 3 10) 


4 2 6 2 In Non Euclidean space Hamming distance 

The above representation m the Euclidean space is suitable for analog signals 
Use is made of a representation in non-Euchdean signal space for discrete 
signals whose importance is constantly increasing In this space, the distance 
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between two vectors x, y is defined to be the sum of the differences of the in- 
dividual components (Blumenthal, 1961) 


d{\, y) = ||x - y|| = Y l^v - yvl (4.30a) 

v=l 

according to the norm of this space: 

11x11 {4.30b) 

v= 1 

The most important discrete signals are binary signals, that is signals in which 
the individual components can assume the values 1 and 0 only. For such signals, 
the signal space constitutes a n-dimensional hypercube having the edge length 
1, where the various edges are occupied by possible signals only. 

Figure 4.18 shows such signal words with one, two, and three bits. Obviously, 
the representation in non-Euclidean space with the norm according to equation 
(4.30b) has the advantage that the distance d indicates the number of digits by 
which two signal words differ from each other. The similarly defined minimum 
distance in a signal alphabet is called the Hamming distance and constitutes an 
important characteristic value for the investigation into a system’s insensitivity 
to noise (Peterson, 1962). To investigate distances between signals, use is also 
made of distance matrices. In the analog-to-digital conversion of signals, a 
transform between the corresponding signal spaces takes place. 



Figure 4.18 Representation of a binary signal in the signal space: (a) one bit; (b) two 

bits; (c) three bits 
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4 2.6 3 Representation as a codegraph 

For an easily understandable explanation of the signal structure, the codegraph 
may be used Based on the results of the graph theory, the m possible states 
(sjmbols) of a signal are shown as an assembly according to Figure 4 19 
Figure 4 19b shows the application of this t^ of representation to a binary 
signal having three digits Where the individual code words are of the same 
length, they can be separated one from the other by counting No characters are 
required for the separation of the individual words, the code is irreducible 
Obviously, this is the case if the end points of the codegraph are occupied by 
code words only 


4.2.7 Typical Signals 

The essential properties of important signals are summarized in Table 4 4 
Individual signals have already been presented as examples m the relevant 
sections The harmonic oscillation, the unit impulse, and unit step arc typical 
lest signals for system identification White noise (wide band noise) has been 
included m the summary as the most important interfering signal For this 
signal, there is no amplitude density ds has already been discussed The 
relationships which have already been pointed out between time and frequency 
functions as Founer transforms become evident the narrower the time signals, 
the wider the band of the frequency signals with the extreme assignments 
between the constant behaviour in the frequency domain and the function 
behaviour m the time domain 
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Table 4.4 Properties of important signals 


Amplitude Autocorrela- 

Charac- Time function density Power density tion function 

teristic x{t) S^^{co) Remarks 
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Of great importance for measurement technology in particular are estimates 
and approximate considerations In contrast to other fields of information 
engineering, for instance control engineering, where input signals are given and 
output signals are to be calculated, m measuring technology the opposite has 
to be done For this reason, before choosing a suitable measuring device, it is 
necessary to make an assessment of the signal behaviour to be expected, on the 
basis of a prion information, and then choose the device To this end, it is usual 
to approximate the quantity to be measured by cither a pulse shaped or ramp- 
shaped signal Therefore, these two signals have also been included in Table 
4 4 The pulse duration and the duration of the ramp function, respectively, can 
be taken from data on the technological process to be investigated, for instance 
the speed m rotating machines According to the sampling theorem, a threshold 
frequency to, corresponds to signal duration 2Tj (refer to equation (4 11c) of 
Section 4224) 

to, = ir/2T, (4 30c) 

Above this frequency there are only spectral oscillations having a relatively low 
amplitude, the amplitude or power spectrum mainly lies below this threshold 
frequency (called band-limited signals) To avoid major measurement errors, 
It IS therefore sufficient for the measuring device to cover this frequency domain, 
I e 0 ^ tu > cu, (see also Section 43 7) 

43 SYSTEMS 
43.1 Classification of Systems 

According to Figure 4 20 a system can be interpreted as a ‘black box’ with a 
family of input variables x,, which can be regarded as a signal vector x, and a 
family of corresponding output variables y,, forming a vector y The interior of 
the ‘black box’ may consist of several elements either electrical, hydraulic, 
pneumatic or of other energy nature The overall behaviour of these systems is 
given by a mathematical equation of the following form 

y = 0{x} (4 31) 



Figure 4 20 Definition of a system 
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Different systems may have the same overall behaviour, that means the same 
relation (4.31). This means that a given system may be substituted by another 
one with the same mathematical equation, for example by a computer of either 
analog or digital form (an aspect of modelling). Because of the convenient 
adaptability of computers to the behaviour of a given system by means of 
programming such modelling methods play an important role in the field of 
cybernetics. Measurement as a part of cybernetics can be comprehended as a 
mapping of the input signals on the space of the output signals (Finkelstein, 
1976). 

Technical systems are characterized as those with active or only passive 
elements (so called ‘active’ or ‘passive systems’) or they are described by the 
number of ports (two-port, three-port, etc.). 

Table 4.5 gives a survey of the classification of technical systems. With respect 
to the difficulties for the treatment of a system it is of great importance to know 
if the system is linear or not, because in the linear case the superposition law is 
valid. In this chapter normally it is assumed that the system is a linear one. 
Methods of linearization will be treated in Section 4.3.2. Another typical charac- 
teristic of a system is whether the parameters describing the behaviour are 
functions of time or not. Furthermore most of the systems used in cybernetics 
are so called unidirectional systems, that means that the parameters of the system 
are independent of those of the following system. We will only deal with systems 
that fulfill this assumption. Otherwise the results of four-pole theory must be 
applied (Feldtkeller, 1962). 


Table 4.5 Classification of technical systems 


System 

Linear system 

Non-linear system 

Mark 

Parameters are constant, 
independent of amplitude. 
Superposition law is valid 

Parameters are a function 
of amplitude. 
Superposition law is not 
valid 

Parameters are time 
invariant 

Examples: 

most of measurement 
systems with small input 
levels: amplifiers; filters; 
transducers 

Examples: 

systems with large input 
levels: output amplifiers; 
driver stages. Often 
linearization is possible 

Parameters are functions of 
time 

Examples: 

controlled amplifiers with 
multiplicative properties and 
small input levels: 
modulators; frequency 
multipliers; parametric 
amplifiers 

Examples: 

controlled amplifiers with 
multiplicative properties 
and large input levels: 
modulators, frequency 
modulators; frequency 
multipliers 
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Figuic 4 21 Spring- 
mass-damper system 


As an example of great practical importance in the field of measurement the 
spring-mass-damper system, as illustrated in Figure 4 21, will be considered 
This system is used in numerous sensors, for, among others, the measurement of 
length or force If force F is the input variable x, and length s the output variable 
y, the following special equation corresponding to the general relation (4 31) is 
obtained 

my + ky + cy = X (4 32) 

This second-order differential equation describes the system’s dynamic 
behaviour 


43 2 Modelling and Linearization 
4 3 2 1 General remarks 

As mentioned above, systems with different intenor elements or form of energy 
can have the same mathematical relationship between output and input 
variables From this fact it follows that a given system can be represented by 
another system having the same overall behaviour This modelling has the 
advantage that with the model system ifie parameters and structures may be 
changed easily by programming a computer Furthermore it is possible to 
observe the input and output quantities in a convenient way by means of 
oscilloscopes or plotters and to change the scale of coordinates or time axes 
Important methods of modelling are analogies, application of block diagrams, 
and linearization 


4 322 Analogies 

Of great importance is the fact that mechanical systems, in the same way as 
pneumatic or hydraulic and other systems, can be presented by electrical 
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(c) 



Figure 4.22 Examples from mechanical and electrical systems; 
(a) Translation; (b) rotation; (c) capacity; (d) analog computer 


systems, as shown in Figure 4.22. In the cases illustrated in this arrangement the 
following equations are valid; 

(a) In the mechanical example (mass m, velocity y, force F) 

V — — Tf dt (4.33a) 

m J 

(b) In the case of rotatory motion (moment of inertia ©, angular velocity co, 
torque M) 


CO = J- I'm dt 

0 J 

(c) In the electrical system (voltage u, capacitance C, current i) 

(d) In the general case of an analog computer (constant c) 

y = c Jx dt 


(4.33b) 


(4.33c) 


(4.33d) 


Generalizing the dependences illustrated in Figure 4.22 realizes the survey of 
analogies between electrical and mechanical systems shown in Table 4.6. For 
more detail refer to Olson (1943), Koenig and Blackwell (1961), and Reichardt 
(1960). For the spring-mass-damper system with differential equation (4.32) 
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Table 46 Suncy of electromechanical analogies 


Electncal 

system 


Mechanical system 


Translation 

Rotation 

direct 

indirect 

direct 

indnect 

i 

F 

V 

Af 

at 

u 

• 

F 

O) 

Af 


I 

k 

1 



k 


k 


C 

m 

1 

B 

1 

e 

L 

\ 

C 

1 

c 


C 


C 



and the representation of Figure 4 21 the electrical models shown in Figure 
423 are obtained Direct analogy (see Table 4 6) yields the parallel circuit 
Figure 4 23a with the equation 

while indirect analogy provides the series circuit given in Figure 4 23b 

“ Ji dt + K( + L ^ =s H (4 34b) 



Figure 4 23 Electncal model of the spnng-mass'damper system of Figure 4 21 
(a) parallel circuit, (b) senes circuit 
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Table 4.7 Symbols for block diagrams 


Function 

Branch 

Summation 

Subtraction 

Sign 

inversion 

General 

system 

Symbol 

X 



Jt 'I'Q" 



Equation 

y, = Vz = X 

y = X, + X2 

y = X, - X2 

y = -X 

y 

= fix, 0 

Function 

Constant 

factor 

Integrator 

Non-linear 

system 

Root 

calculation 

Multiplier 

Symbol 



D 


X 

? 


Equation 

y = kx 

y = Jx dt 

y = f(x) 

II 

y 

— Xi • X 2 


As can be seen from a comparison of Figures 4.21 and 4.23 the indirect analogy 
is more convenient because a mechanical parallel circuit is modelled as an 
electrical parallel circuit. 

4.3.2.3 Block diagrams 

Measurement systems are built up from particular subsystems. Therefore it is 
suitable to represent each subsystem by a block including a symbol indicating 
the operation the subsystem has to realize. Table 4.7 contains some of these 
symbols and signs used for the demonstration of the interconnections between 
the systems. (It must be pointed out that many standards for such are in Use.) 
It is supposed that the systems are unidirectional. Otherwise it is possible to 
describe the behaviour of such a system by means of an interconnection between 
several unidirectional systems. Figure 4.24 illustrates three typical methods of 
interconnection of systems. The output-input relation of a system is given by 
the equation 

y = Gx (4.35) 

The frequency response for an equivalent system having the same overall 
behaviour to that of the interconnection of the subsystems shown is given by; 

(a) scries circuit (Figure 4.24a) 


Ge = n G. 


(4.36a) 
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(b) 

parallel circuit (Figure 4 24b) 



Cc=J:g, 

(4 36b) 

(c) 

m opposition (Figure 4 24c) 



- — ^ — feedforward, oscillator 

1 — OjOj 

(4 36c) 


o, = 



■; Jr-TT negative feedback, control 

1 + G,Gj 

(436d) 



Figure 424 Typical system connections (a) series circuit, (b) parallel circuit, 
(c) connection in opposition 
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The product in equations (4.36c,d) describes the frequency response of the 
open control loop. 


4.3,2.4 Linearization 

Linear systems are distinguished by the validity of the superposition law. Non- 
linear systems are often linearized to enable the advantages of linear systems 
to be used. The following preliminary conditions must be fulfilled in such a 
strategy: 

(a) only small deviations of the characteristics from the linear course can be 
used; 

(b) only a relatively small drive range of the non-linear characteristic can be 
tolerated. 


For the linearization the Taylor series expansion of the non-linear characteristic 
y = f(x) at the working point = /(xq) is employed. Writing only the 
deviations from the working point Ax, Ay yields 


y - yo = Ay 


dx 


3 15V 

Ax -f- -~ 

A 2 15V 

Ax “T — T 

xo 2 dx^ 

xo 6 8x^ 


Ax^ + 


(4.37a) 


Ixo 


In practical measurement technique the input variable is often the sinusoidal 
function x = sin(caf). The proportion of the dominant wave with frequency 
CO at the output expressed as a ratio with the amplitude of the input j?, gives 


1 5x,„V 8 5//axU / 


(4.37b) 


This function is the describing function ; it expresses the frequency response of a 
non-linear system. Depending on the sign of the third-order differential co- 
efficient, the describing function either increases or decreases with the square of 
the amplitude of the input as Figure 4.25 shows. In practice the case 
d^fjdx^ > 0 can be troublesome because the amplification factor is increasing 
with increasing amplitude leading to an unstable oscillating regime (Woschni, 
1973). 

Important in the field of measurement is the rectification effect using a square- 
wave characteristic 


4 dx^ 




(4.37c) 


In this case distortion appears that is described by the distortion factors 


^3 



1 /a^//ax^u \ 

4V3//5xuJ ^ 



(4.37d) 


(4.37e) 
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Figure 4 25 Ratio of the amplitude of the dominant wave of the 
output to the amplitude of the input X (describing function) 


4 Systems in the Time Domain 


4 3 3 1 Description by differential equations 

The oldest method used in solving problems of system analysis is the method of 
diHerential equations A linear system is described by a linear differential 
equation of the following form 


any"' + 


+ 0,/" 


hi 

= GqX 4- — X + 
«o 


-f- fl2>’ + aiy + Ooy 


= boX + hjX + + 

This equation may be written 

+T'y'^ + 


(4 38a) 


+ Tly + T^y + y 




(4 38b) 


where G© is the static sensitivity 

ii„ _ 

Oq — ~ 

Oq Ax 


(4 38c) 


Go can be measured as an amplification factor by means of a small alteration 
Ax as stated in equation (4 38c) The coefficients 


(4 38d) 
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are time constants. For a differential equation of nth order n time constants are 
defined, the greatest of which is used to estimate the duration of the transient 
function. The solution of the differential equation consists of two additive 
components, a stationary and a dynamic portion : 

y(0 = Ts, + yd(0 (4.39a) 

To solve the homogeneous differential equation leading to the dynamic portion 
the assumption 

yd = C eP' (4.39b) 

is used. The zero points of the characteristic equation 

TUp" + T”: JpP-i + ■ • • + Tlp^ + TiP + 1 = 0 (4.39c) 

signify whether the solution is stable or not. In particular these so called eigen- 
values p, prove that 

(a) if the real part is less than zero, i.e. Re(pr) < 0, a stable solution exists; 

(b) if the real part is greater than zero, i.e. Re(p,) > 0, an unstable solution 
exists; 

(c) if the eigenvalues are complex, oscillations with decreasing or increasing 
amplitudes exist. 


With the eigenvalues p^ the dynamic solution yields (Coddington and Levinson, 
1955) 

n 

Z<^rexp(p,t) 

r= 1 

(4.39d) 

If a double root po arises 


^d = (Cl -f C2t)exp(pof) 

(4.39e) 


The stationary solution y^, is to be found by means of suitable terms satisfying 
the inhomogeneous differential equation (4.38b). 

As may be seen from equation (4.39d) the eigenvalue p^ corresponds to a time 
constant % = 1/p^, the greatest value of which is responsible for the 

duration of the transient process. Because e'^=^4 5% the transient process 
approximately continues and 

hr = (4.39f) 

where is the transient time. 

4.3.3. 2 Basics of state space description 

The basis of the state space description is the classical differential equation 
discussed above. The state of a system is described by a set of state variables y^. 
The number of these state variables agrees with the degree of the differential 
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equation Today the method is of growing importance because of its aptitude 
for computer simulation of systems The nth-order differential equation given 
for computer simulation of systems 

The nth order differential equation given by (4 38b) may be transformed into 
a system of n differential equations of first order 

> =>i 

3 = >2 = )l 

} = 33 = >2 


(440a) 


3*' " = 3, = 3- 1 

_ _ 1 _ T, “Tl _ _ t;~} J_ 


This system of equations can be written in the form of a vector differential 
equation (2^deh and Desoer, 1963) 
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Tn. 


(440b) 

or with the abridgements j, T, b and x 

J = Ty + bx (4 40c) 

Consider as an example the spnng-mass-damper system of Figure 4 21 with 
the differential equation (4 32) The vector equation is given by 


Tv.T 

■ 0 1 ■ 


0 

r 

c k I 


1 

L32J 

m mj 

L32j 



The state variables are the displacement > = jj of the mass and the velocity 
3=32 Figures 4 26 and 4 27 respectively show the programming of analog and 
digital computers for modelling this system The relationships between pro- 
gramming and state space descnption are easy to recognize 


4 3 3 3 Step response function, pulse response function 

For system description and identification certain response (output, ansHer) 
functions for test signals at the input are generally used These are presented 
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here as time-dependent functions but, as is to be seen in Section 6.3.2, they can 
be used with any general variable x, spatial distribution being one used in optical 
systems. 

(a) The response to a step function with a step amplitude of 1 (unit step func- 
tion, vv(t)) is the unit step response or transient response h(t) as illustrated 
in Figure 4.28a. 

(b) The response to a pulse function with an integral value of 1 (Dirac delta 
function, S(t)) yields the unit pulse response or weighting function g(t) 
shown in Figure 4.28b. 

Because of the linearity of the system the response function y(f) is to be divided 
by the step amplitude or the integral value respectively to get the normalized 
function. In the case of system identification if the input function generated by a 
signal generator is not the ideal function but a function with a rise time At as 
signified in Figure 4.28, the condition must be fulfilled that the rise time At, or 
the pulse width At, are very much smaller than the transient time t,,. of the system 
(Woschni, 1973). 

As can be seen from the comparison of parts of Figure 4.28 the pulse function 
is connected with the step function by means of a differentiation, that is, in the 
sense of distribution (Gelfand and Schilow, 1960), 

dcu(t)/d/ = 5(0 (4.41a) 

For linear systems it is immaterial whether the differentiation is realized at the 
input or output side of the system, that means 

^'(0=j9(0df (4.41b) 

It is only a matter of suitability, whether the transient or weighting functions are 
used for identification. For a system with first-order delay, as is used for the 
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Figure 427 Flow chart for the programming 
of a spnng-mass-damper s>stem on a digitial 
computer 


approximation of several systems m measurement technology, tve gain the 
differential equation 

Ti}+y = x (44Ic) 


with eigenvalue 


Pi = -1/7; 


{441d) 
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Figure 4.28 (a) Unit step tv(t)and transient response l!(f); 
(b) delta function 5(t) and weighting function g{t) 


The stationary solution may be found by means of the assumption 

Tst = C2 

and the total solution yields 

3’ = = C 2 + Cl exp(-t/Ti) 

Using the boundary conditions 


3'lr=-oo = 0 yl,= a> = l 



142 


HANDBOOK OF MEASUREMENT SaENCE 



Figure 4 29 Transient response and weighting function for a 
first*order system 


we get for the transient response Ht) 

/.(t) = I - exp(-r/r,) (441c) 

The weighting function g(c) follows from equation (4 4lb) 

»(') = = Y ~ > (■' ■’'fl 

The functions h(t) and g{t) are represented in Figure 4 29 If these functions are 
obtained experimentally by means of a function generator at the input of the 
system examined the time constant T, is gisen by the length of the subtangent 
(Figure 4 29) Furthermore the figure shows the transient time to be nominally 
three times the time constant T, for e For more details referring to 

testing of systems and important examples see Sections 43 8 and 43 9 


4 3 3 4 Generalized response functions coniolution 

Transient response and weighting functions are response functions to special 
input signals In the general case the input function is broken down into a series 
of weighted Dirac delta functions, which are time-delayed as represented in 
Figure 4 30 The pulse at the time t, yields the output 


9(t, - t,) 
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Figure 4.30 Explanation of the conxolution integral 


Becauxe of the linearity of the system the superposition law is valid, that means 
the entire output is the sum (integral) of all inputs at the time / - t, > 0: 

.riO = f .x(r)7(r - T)dr 

v'O 

= f A.{f - ■c)q{-) dr 


= •'■(0 * fir (0 


(4.42a) 
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This IS the convolution integral, denoted by the sign • for the convolution The 
lower limit of the integral may be extended to infinity since the weighting 
function must be zero before the input is applied Since the input is zero for 
r < 0 the upper limit may also be extended to infinity, this means equation 
(4 42a) may also be written 

>(t) = J x(t)g(r - t) dr 

= J x(r-r)3(r)dT (442b) 

Another form of the convolution integral, the Duhamel integral, is obtained by 
taking into consideration equation {44Ib) 

>(() = ^ J - r) dt = i J x(r - t)Ki) = WO • '>(0] (‘•‘12c) 

In this equation the upper and lower limits may also be extended to infinity 
In the field of measurement convolution is of great importance for system 
identification (Davies, 1970) The autocorrelation function at the output of a 
system and the input are related by a double convolution (Woschni, 1981) 

W')=f C + 'i - '2Wri)9(^2)<l^i <•'! (‘*‘•30 

Jo Jo 

If the autocorrelation function of the input ^„(t) and the cross-correlation 
function T,/t) are measured the weighting function of the system can be 
calculated, for which the following relation is valid (Davies, 1970) 

'l'„M = f 9(0l('„(2 - 0 dl ('• ‘•31>) 

Jo 

Deconvoluting equation (4 43b) gives the required pulse response function 
g({), the system may be regarded as being identified Direct deconvolution 
techniques using equation (4 43b) can be difficult Therefore methods m the 
frequency domain were developed as shown in the following sections In the 
special case of white noise with an autocorrelation function — 0 = 
2;r5o5(T — t) the deconvolution degenerates to the equation 

= 27tSo9(T) (4430 

This means that the weighting function corresponds directly to the cross- 
correlation function (Woschni, 1973) 
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4.3.4 Systems in the Frequency Domain 


4.3.4.] Frequency response: logarithmic characteristics 

Using a sinusoidal input and taking up both amplitude of the output ? normal- 
ized to the input X and phase <p as a function of frequency cu provides the fre- 


quency response for which 

amplitude characteristic is I G(jcu) I = (4.44a) 

phase characteristic is <^( 0 ;) ^ y,x (4.44b) 

In the complex presentation the input x = 6^“' yields the output y=f 

= f e^’’ e^®", that means the complex frequency response is 

f 

j eJ^ = G(}(o) = I G( jro) 1 (4.44c) 


This function is represented in the complex plane as a locus diagram. The com- 
plex frequency response may be split into a real and an imaginary part: 


G(jcu) = Pico) + je(ft)) (4.44d) 


with the relations 


|G(ja;)| = ^lP\co) + Q\co)-\ (4.44e) 

(Pico) = arctan^ll^j (4.44f) 


Figure 4.31 explains the representation of the frequency characteristics. If the 
differential equation is given it is very convenient to obtain the frequency re- 
sponse by means of the terms 


x*"’ = (jcu)";^ ej“"; y'''> = (jco)"f e^’’ e^"' (4.45a) 


Substituting the differential equation (4.38b) and solving the output-input 
relation yields the complex frequency response 


G(ia)') = ^ = ^0 + i^bjap) -f • • • -f (jaj)’"(fc Jup) 

~Y' 1 +..; + (jcu)"T;: 


(4.45b) 


This implies that substitution of nth-order differentiation by (joj)" and nth- 
order integration by (1/jco)" will yield the output-input relation. 

The frequency response can be measured, by way of equations (4.44a, b), 
using a sinusoidal test signal. If the input is a stochastic signal with power 
density S^,(tu) the output power density SyJipS) may be calculated according 
to (Davies, 1970) 


S„,(m) = jG(jcu)pS„(cu) 


(4.45c) 
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The double convolution (equation (4.43a)) in the time domain corresponds with 
a multiplication with |G(jaj)|^ in the frequency domain (Woschni, 1973). 

Of great practical importance is the plotting of the amplitude characteristic 
in a double-logarithmic calibration graph and the phase characteristic with a 
linear (p-axis and a logarithmic co-axis, known as logarithmic frequency 
characteristics. The amplitude is generally measured in decibel (dB) units, 

20 log(?/l) = 20 log I G(jco) I (4.45d) 

in order that a linear scale can be used for the y-axis (Bode diagram). The 
advantage of this method is illustrated in Figure 4.32. As treated in Section 4.3.2.3 
the overall behaviour of series-connected systems is given by the multiplication 
of the frequency responses of these systems (equation (4.36a)). Because of the 
logarithmic representation the multiplication is simplified to a summation, 
which can easily be realized graphically. 

For a system with first-order delay, used for approximation of more compli- 
cated systems, from the differential equation (4.41c) of Section 4.3.3.3 we get 

G(jcc>) = 1/(1 + jtoTj) (4.46a) 

|G(ja))| = l/vTWrf (4.46b) 

(p(a)) = — arctan(cuTi) (4.46c) 

The frequency response functions are featured in Figure 4.33. From Figure 
4.33b results a critical frequency (or co^) given by 

2nf, = l/T, (4.46d) 

used for approximations (Woschni, 1973). 

Important examples of measurement systems are discussed in Section 4.3.9. 

4.3.4.2 Transfer function 

A generalization of the complex frequency response arises if the frequency jcu 
is extended to the complex frequency 

p = jco + d (4.47a) 

with the increase constant d. 

This complex frequency p is the same as the variable p (or s in the mathe- 
matical literature) used with Laplace transformation (see Section 4.3.5). In 
the physical sense p means a harmonic oscillation with an exponential increasing 
or decreasing amplitude (Woschni, 1973): 

x = = X e"' ej“' (4.47b) 

If p is represented in the p-plane the left-hand side of this plane signifies stable 
solutions ((5 < 0) while the right-hand side leads to unstable solutions (<5 > 0). 
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Figure 4.33 (a) Locus diagram; (b) amplitude characteristic and (c) phase 
characteristic for a first-order system 
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This criterion is used for assessing the stability of systems as treated in Section 
4 3 6 The complex frequency p used instead of (o results in the transfer function 
G(p) for the system described by equations (4 38b) or (4 45b) 


+ pr^ o—p' 


The G(p) plane is a conformal mapping of the p plane (Zadeh and Desoer 
1963) that means the side directions of the curves remain valid as the next 
example shows The system with first order delay with the frequency responses 
(4 46a b c) has the transfer function 


given jn Figure 4 34 with seteral values of ^ It can be observed that Figure 4 33 
is a special case included in Figure 4 34 


4 34 3 Pole zero configuration 


By means of searching the zero points of both the numerator p* and the divisor 
p„ of the fraction (4 47c) one gams the equivalent product representation the 
polynomial equation alternative of equation (4 47c) 


/■( ' - p* xp ~ p*) (p - ■ nr 1 (p - pt) 

(p - ptXp -pi) (p-p.) m >(p-pj 


(4 48a) 



Figure 4 34 Representations of the transfer function for a first-order system 
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having the poles and the zeros p* of the transfer function. Therefore the 
properties of a system are described completely, up to a constant c, by the 
position of poles and zeros, represented in the pole-zero plane (Figure 4.35). 
The poles agree with the eigenvalues of the differential equation. 

The frequency response can be calculated from the pole-zero plane by 


G(j(u) = c 


Ylr-l - Pr) 

nr.=i 0" - p^) 


(4.48b) 


Figure 4.35 shows how, for series connections, the pole-zero representation of a 
complicated system can be split into a sum of simpler systems. Poles and 
zeros at the same point cancel each other; this feature is used for the correction 
of systems by means of additional series-connected correcting elements (Section 
4.3.10). The position of the poles contains information as to whether the system 
is stable or not: existence of poles in the right half-plane signifies instability 



-j t -J f 

Figure 4.35 (a) Splitting up the pole-zero plane of a system; (b, c) into series- 
connected subsystems (x poles; O zeros) 
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(Section 4 3 6) In measurement the phase shift bridge (Figure 4 36a) is often 
used It has the transfer function 


G(P)=\ 


1 - pCR 
I + pCR 


and the pole and zero at 

1_ * _ J_ 


The pole-zero plane (Figure 4 36b) shows a symmetrical position of pole and 
zero referred to the imaginary axis This configuration of poles and zeros is 
typically for all-pass circuits having a constant amplitude charactenstic and a 
frequency-dependent phase characteristic as follows directly from equation 
(4 48b) All-pass systems play an important role for the correction of the phase 
characteristic (Woschni, 1981) 

Every system containing zeros in the right half-plane can be split into an all- 
pass and a so called minimal phase system without zeros m the right half- 
plane, as shown in Figure 4 37 for the system represented in Figure 4 35c 


4 J.5 Relevance of Time and Frequency Domains and Transforms Between 
Both: Laplace Transform 

In Section 4 24, especially Table 4 2, the relationships between time and 
frequency presentation of signals are treated In a similar way the relations 
may be described in systems 


(b) 


+] 



Figure 4 36 Phase shift bridge (a)arcuit,(b)pole zero plane diagram 




Tablc48 Laplace Irjnsformsof lime dependent functions(Woschni and Kraus, 1976) 



(jB)iIsoo - {jq)qso3 
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The basic idea of the calculation of the tunc functions, i e , g{t) or is as 
follows Both the spectral function ^(jto) of the input and the frequency re 
sponse of the system G(ja)) or generalized) G(p) are given It is then possible to 
derive the output caused by any of the several sinusoidal input spectral oscilla- 
tions by multiplying the complex spectral density Jf’(jto) with the frequency 
response then applying summation (integration) of all frequency components 
This IS valid because the superposition law applies In system theory the Laplace 
transform is preferred to the Fourier transform because it converges more 
quickly It can be derived by substituting jcu -* p in the Fourier transform 
equation (4 28) 


f(p)= r/(()e-'”dl = L(/(r)) 

Jo 

(4 49a) 

m = 2^ £ “°F(p)o'“ dp = Z. ' {f (p)) 

(449b) 


In system theory this so called one sided Laplace transform is made use of, 
for only the region t > 0 is interesting For solving optical problems the two- 
dimensional Laplace transform or Fourier transform is applied (Goodman, 
1968) The convergence abscissa c in equation (4 49b) is chosen in such a way 
that the poles remain to the left of this abscissa Due to the residue theorem 
(Kaplan, 1962), 

£ PWe" dp = 2pj Y. Res(p,) (4 50a) 

and for a pole of nth order 


Resfpo) = - — hm [F(p)e'’'(p - Po)"] (** ^^b) 

(n - l)\^p„dp 

Table 4 8 provides (and later in Figure 6 2) Laplace transforms for commonly 
met time functions Table 4 9 gives the Laplace transforms of test signals 
(Woschni and Kraus, 1976) Table 4 10 surveys theorems of the Laplace trans- 
form Physical considerations lead to the following relationships between out- 
put and input (Woschni, 1973) 

L{j(t)} = L{x(0)C(p) ,(0 = t'‘{L{x(I))G(p)} (4 51a) 


This equation yields for the transfer funrdion G(p) 


G(p) = 


LWO) 


y(p) 

X(P) 


(4 51b) 
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(continued) 





SIGNALS AND SYSTEMS IN THE TIME AND FREQUENCY DOMAINS 


159 


Table 4.10 Theorems of Laplace transform 


Addition theorem 


L{m + MO) = L{A(t)} + 


Multiplication theorem 


L{afit)} = aL{fit)} 


Shifting theorem 


for a > 0 L{/(f - a)} = e“'’“F(p) 


or L{/(£ + a)} 






Likeness theorem 


Attenuation theorem 


iffl>0 


L{e “'/(£)} = F(p + a) if Re(p + a) > Po > 0 


Limit theorem 


lim fit) = lim pFip); lim /(f) = lim pFip) 

l“*00 p-*0 I-*0 P*-*CD 


Integration theorem 


:.| J^/(T) drj = ^ L{/(t)} if Re(p) > 0 


Differentiation theorem 


L{/'">(t)} = p"L{/(f)} - p'-'/( + 0) /'"-‘>(+0), 


if the limits 

lim fit) = /(+0); lim /(f) = /(+0); . . . ; 
r-»0 f-»0 

lim /<"-»(£) = /'-'-D(+o), exist 

I-O 

(9)-(10) Convolution theorem 

If the integrals J e~ *’'/,(/) df and J e“ '"/aCO df both are absolutely convergent 
or at most one absolutely and the other conditionally convergent yields 

L{m)L{f2it)) = L{hit)*m) 


flit) * / 2 (f) = f /i(T)/j(f - t) dt = { flit- t)/ 2 (t) dr 
•>0 4o 



160 


HANDBOOK OF l.tEASUREMENT SCIENCE 


Using the Laplace transforms of ihe unit step and the Dirac function (Tables 
4 8 and 4 9) provides, instead of equation (4 5Ib), 

or 

G(P) = = pLm) = p J"(,(t)e it (4 51d) 

and for the calculation of the time functions 


I i^+j® 

g(t) = L G{p)e^dp, 

2JtJ Jf-,* 


From equation (4 51a) it follows that 


><t) = L- ' {L{xU)}L{gm = £ 41 ) 9(1 - I) dr = j(l) . 9(1) (4 51g) 


><0 = L-'{L{x(t)]pLm)) = ^ £j.(r)ft(r - r) dt = | WO • KO] 

(4 51h) 

giving the convolution theorem of the Laplace transform (Table 4 10) Figure 
4 38 demonstrates this fact By means of these relations the deconvolution 
problem may be solved (Davies, 1970) 

40 = 1. ‘|LWO}^| = £}<t)g‘(t-T)dt = x(O*9*(0 (4510 

With 

The deconvolution often becomes extremely difficult because it is generally 
not possible to realize the inverse system functions An alternative approach to 
the solution, by means of Fourier transforms, is often used (Davies, 1970, see 
also Section 4 3 34) 
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x(f) Time domain 

1 t 

L L-' 

I 

>_. G{p) — >- y(p) Frequency domain 

Figure 4.38 Convolution relationships 


4.3.6 Stability 

A system is stable if one of the following conditions is met: 

(i) The eigenvalues of the differential equation exhibit a real part less 
than zero, i.e. 

Re(p,) = 5, < 0 (4.52a) 

(ii) In the right-hand side of the pole-zero plot there are 

no poles of the transfer function ; (4.52b) 

(iii) Decreasing eigenfunctions occur; this means that the weighting function 
git) fulfils the condition 


lim f \git) \ dt ^ M < CO (4.52c) 

C-* CO •/o 

For the examination of these stability conditions certain stability criteria can be 
applied to test the situation: 


(a) If the following differential equation is given 

+ ■■■ + aoy = boX + ■•■ + 

then the Hurwitz-Routh criterion can be used to test the stability. All 
coefficients a^, and the determinants 


Oj Qq 0 • • • 0 

fla 02 ^1 ^0 0 


(4.52d) 


|a2^_I 02 h -2 ■■■ 

have to be positive, i.e. > 0, > 0. 

(b) For practical applications graphical methods based on the locus diagram 
representations are of value. The transfer function G(p) consists of poly- 
nomials in the numerator Nip) and the divisor Dip): 


Gip) = Nip)/Dip) 


(4.53a) 
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Application of the stability condition (4 52b) shows that 

the polynomial of the divisor D(p) is not permitted to have zeros 

in the right half-plane including those lying upon the imaginary axis (4 53b) 

(stability limit) 

For testing the stability of a given system the divisor of the frequency response, 
le D{]co), IS drawn on the complex plane diagram) forming a closed 

curve by inclusion of negative frequencies Because of the conformal mapping 
the unstable field is always that lying on the right-hand side of the locus curve 
drawn from w = — cotO(u = -hoo (see Figure 4 39) To test for stability it is 
necessary to verify where the zero point is situated 

If the zero point of the D(p)-plane remains left of the locus curve 
D(jto), passed through from <t) = --ootoa) = +co, the system is (4 53b) 
stable If not it is unstable (Figure 4 40) 

The procedure is also suitable for the statement of the stability margin (see 
Woschm, 1973 or Zadeh and Desoer, 1963) 

As an example of a commonly met system in instrument systems consider the 
feedback system shown in Figure 4 24c For negative feedback (control) the 
frequency response is given by equation (4 36d) of Section 4 3 23 

r /, \ <^i(JQ>) 

‘ ^ I + G,(ja>)G2(ja)) 

The D{}oj) function is 

1 + 0,(ja>)C2(ja)) (4 53c) 


1" 

”1 


S<0 0 




Figure 4 39 Locus diagram of the divisor D(p) 
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Figure 4.40 (a) Unstable system; (b) stable system 


Instead of testing this function it is more convenient to verify that the open- 
loop frequency response 

Gi(ja))G2(jo)) (4.53d) 

fulfils the condition related to the point ‘ — 1’ as shown in Figure 4.41 (Nyquist 
diagram). The diagram illustrates that an increasing amplification will take the 
system from a stable to an unstable state. This fact may be caused by a non- 
linearity and may lead to unstable oscillatory behaviour, (see Section 4.3.2.4). 



Figure 4.41 Testing stability of closed-loop systems 
(control) 
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Structure-stable or structure-unstable systems may be distinguished (Tou, 
1964, Newton et al , 1957) 

43.7 Approximations 

As pro\ed already in Section 4 2 7, in the field of measurement approximations 
are of great importance because of the small a priori information of the signals 
to be measured For the descnption of the behaviour of systems it is usual and 
useful to introduce characteristic values which can be gained by means of 
approximations from the functions tn the frequency and time domains (see 
Figures 4 42 and 4 43) The amplitude-frequency characteristic (Figure 4 42) 
yields the upper and lower critical frequcnacs,^ „ and /« , or cOj^u and ,, 
where jG(jco)j decreases to ^2 = 07^3 dB of the reference value From the 
transient response /i(t) (Figure 4 43) the important transient time f,„ the dead 
time fd . the delay lime t, , t he compensation time r* , and the maximum overshoot 
Axo are obtained The transient time is approximately (see Section 4 3 3 1) 

h, = (4 54a) 

and from the sampling theorem the transient time and the cntical frequency 
are related by 

/c u = l/2tu (4 54b) 

In measurement systems engineering these approximate considerations are of 
great importance, for example as in the selection of suitable measuring instru- 
ments Qmsider the following example Figure 4 44 illustrates the problem of 
measuring a pulse-shaped input function, an approximation for numerous 



Figure 4 42 Definition ofthe cntical frequencies 
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Figure 4.43 Definition of the characteristic values in the time 
domain 


measuring tasks (Woschni, 1972a). In this figure three cases are presented. The 
long dashes represent the example in which the transient time is equal to the 
pulse width, i.e. = AT. The pulse height is still correctly indicated, whereas the 
pulse shape is strongly distorted. In contrast to this, considerable error is intro- 
duced when measuring the pulse height if the transient time is too long (short 
dashes). To permit proper determination of the pulse shape, the transient time 
has to be substantially shorter than the pulse width (chain curve). The same 
considerations may also be useful for analysing measuring errors. If one expects, 
for instance, a pulse-shaped behaviour of the output and receives, as the re- 
sponse, an output variable with a heavily prolonged trailing edge, then an error 
will be present (short dashes in Figure 4.44). It is noteworthy that minor and 
medium errors in amplitude measurement frequently produce more detri- 
mental effects in practice than the very large measuring errors. For errors of 



Figure 4.44 Approximation of the output for the case of 
a pulse-shaped input 
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some 10% in fields, the machine-made arcmt element that has been sized 
according to this measurement will function for a certain time due to the 
safety margins, and especially it will withstand the initial tests After having 
been produced m series and operated for some time, however, all elements fail 
at the same point according to the fatigue curve for the number of stress re- 
versals possible up to the failure of the element On the other hand, major 
measuring errors will mostly become evident during testing 

4 J.8 Testing of Systems 

In Section 4 1, especially Figure 4 2, the typical problems of measurements are 
treated Testing of systems is the task of system identification 
Because of the relevance of time and frequency domains it is possible to 
calculate either of these functions if the other one is measured Therefore, it is 
the availability of equipment that decides which of these characteristic functions 
IS to be found 

Testing of systems is performed by means of lest signals produced by a test 
generator, recording the corresponding output signals as illustrated m Figure 
4 45 Table 4 1 1 gives a survey of the test signals used, the output signals, and 
the characteristic values used for approximations Today the process of ob 
taming the characteristic functions is often automated, making use of pro- 
grammable function generators that are controlled by microcomputers, to 
form the appropriate input signals 

In particular the frequency response GOo) is obtained by measuring both 
phase angle and proportion of the amplitudes of output to a given sinusoidal 
input Before taking the true values it is necessary to wait until the steady state 
solution appears, le r > r,, = 1/(2X) If the measuring device itself has a 
non-ideal frequency response Gj,(ja>), G^Ocu) the real frequency response G(jcd) 
may be calculated from the wrong value G*{jcu) by means of the relation 
(Woschm, 1972a) 

G(ja,) = C*(j<a)5^ (4 55a) 

G,(jco) 

Othere principles of calibration make use of comparison systems or reciprocity 
principles for systems with reversible operation (Woschm, i972a) 

If the characteristic functions in the time domain, g(t) of /i(t), are to be found 
a problem arises in that the input signals are not the ideal ones, as shown m 



Figure 4 45 Testing of systems 
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Table 4.1 1 Survey of test signals 


Characteristic 

function 

Input test signal 

Output function 

Characteristic 

values 

Differential 

equation 

Not specified 

Not specified 

Time constants 

T, 

In the frequency 

Harmonic oscillation 

Frequency response 

Critical 

domain 

X = 

<7(ja>); 

Transfer function G(p) 

frequency 
/cl Wc 

In the time 

Unit step function u’(t) 

Transient response h(t) 

Transient 

domain 

Unit pulse = Dirac 
delta function S(,t) 

Weighting function g{t) 

time t , 
dead time 
delay time tj 
compensating 
time 

overshoot Axq 

Stochastic 

White noise 

Cross-correlation 

Transient time 

functions 


function 

^tr 


S((o) — constant 


Correlation 

time 


Figure 4.46 for a non-ideal step function Instead of the real transient 
response h(t), h*(t) is used ; 


^■*(0 = T f 'v*(T)/i(t — t) di (4.55b) 

(It Jo 

By means of a deconvolution it is possible to gain the real transient response 
where w*{t) is known (Woschni, 1972a). In practice it is usually sufficient that 
the rise time of the step function is smaller than one magnitude of the transient 
time of the system to be tested. 



Figure 4.46 Testing with a non-ideal step function 

W*(f) 
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Of great practical importance, especially for self-adaptive systems, are those 
methods of identification which use the noise at the input of a system (Davies. 
1 970) As shown m Section 4 3 3 4 the weighting function g{t) may be calculated 
by deconioluting the relation 

= f df (4 55c) 

Jo 

Because deconvolution technique becomes difficult, methods in the frequency 
domain or special noise sources are used (Davies, 1970) Use of white noise 
yields 

^,,(t) = constant g(x) (4 55d) 

that IS, the weighting function becomes a constant multiplied by the cross- 
correlation function (Woschni, 1981) For more detail, including errors arising 
from use of a non-ideal correlation function, and the reduction of errors sec 
Davies (1970) 


43.9 Typical Systems 

Table 4 12 gives a survey of the most important measurement systems and 
characteristic functions in both the time and frequency domains (Woschni, 
1981) The results of this tabic, in prinaple at least, may be used to find optimal 
parameters of a system For example, the best damping of the spring-mass 
damping system, typical of a great number of measunng systems, can be read 
off to be approximately 1 (precise value 07) The transfer functions of typical 
systems as illustrated in Table 4 12 allow the user to gain a survey of typical 
curve distortions and their causes using the methods for approximations of 
Section 4 3 9 These considerations lead to the results summarized in Table 4 13 
As input function a pulse-shaped curve is assumed Errors are dealt with in more 
detail m Chapter 6 of this handbook 


43.10 Some Remarks on Optimization 

Optimization is treated here speafically with respect to system theory Both 
dynamic errors and errors caused by disturbances, eg noise, are taken into 
consideration 

The behaviour of measuring systems depends upon a number of parameters 
Aj,eg time constants 7^. which, at least to acertain extent, may be freely selected 
If the parameters of the system cannot be changed, an optimization can be 
effeclcd by a following correction system or by a correction system connected 
m opposition, the parameters of such a system being adjustable With the 
advance of microprocessors, computers of that kind will be employed for this 
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purpose, the behaviour of the computers necessary for the optimum behaviour 
of the entire system being ensured by suitable programming. 

The fundamental idea is shown for one-dimensional problems in Figure 4.47. 
The difference between the output variables of the ideal optimum system (model) 
and the real system, with a following correction system, is formed and assessed 
according to the computing instruction defined by the optimization criterion. 
The parameters /c,- of the correction system are set so that the real system re- 
sembles the optimum system as far as possible. Principally, we distinguish 
between static and dynamic optimization (Bellman, 1961) according to the 
definition of the performance criterion. If the system is dimensioned so that a 
performance function that depends directly on the parameters becomes 
optimum, we have a static optimization. In dynamic optimization, on the other 
hand, a process x{t) is sought such that a performance function depending on 
this process— a functional— assumes an extremal value (Ventcelj, 1964). 

The previous considerations imply that the central problem of optimization 
is the creation of a suitable optimization criterion. 

In most cases the criterion of the mean square error, as shown in Figure 4.48, 
is used. If Greai(j®)GcorXj<jj) IS the frequency response of the system corrected 
by a following correction system Gcorr(j<y) as shown in Figure 4.47, we obtain 
the mean square error (Woschni and Kraus, 1975); 




5'a:X(«)|Gid(jf«) - G,eal(jw)Geo„(j")P d" 


pco 

+ 5zz((y)|Gco„(j(w)pdcy 
•^0 

= + p. 


(4.56a) 


with the assumption having been made that the disturbances occur prior to the 
correction computer. 

The total error according to equation (4.56a) is composed of two components : 
the error due to insufficient dynamic behaviour and the error due to the 
disturbance P^. If there is no correlation between signal source and disturbance 
source, as has been assumed here, the two error components must be added to 
arrive at the total error. When the error components and total error are plotted 
against the correction degree ‘a’, relationships_will always result as presented in 
Figure 4.49. The dynamic error component p^ decreases as ‘a’ increases and 
disappears for ‘a’ ->■ co (ideal correction), whereas the error component due to 
the disturbance increases with ‘a\ Figure 4.50 provides the physical demonstra- 
tive explanation. In this figure the amplitude responses iGreai(j(a)l of the un- 
corrected system as well as of the correction system |Gaorr(j<u)| are plotted. To 
compensate for the decrease of the uncorrected system at frequencies above 
the limiting frequency cUa the correction system must raise these frequencies 



Table 4 12 Survey of typical systems and their characteristic function? 


Mathematical 

formulation Transfer locus Amplitude phase curve 







Function of time 


transfer function weighting function Examples for systems with 
Poles ( X ) and zeros (0) xji = /i(t) aj\ = g(t) such a behaviour 


None 



> COS” Z? 


Double 
jT / Zero 

-/■— 

NLh/T’z 


Cannot be exactly 
represented 


M-j 

Factor ,-/,/? cannot be 
exactly represented. 




systems with proportional 
behaviour (idealised) 



/=o 7; t /=0 / 


tempera ture-measuring 
arrangements without 
protective tube; very 
heavily damped systems 
capable of vibrating with 
proportional behaviour 
(idealized); system with 
delay of the first order 
and compensation 




spring-mass damping 
system, D < 1, 
temperature-measuring 
arrangements with 
protective tube; D < 1, 
approximation behaviour 
for vibration systems 



spring-mass damping 
system without fixed 
point D ^ 1 

system with pure dead 
time (idealization), caused, 
e.g., by pipeline, transport 
path, etc. 

real systems with dead 
time, e.g., temperature- 
measuring arrangement 
with heat conductivity 
feed to transducer 




Tabic 4 13 T>pical vkave-shspe distortions and their causes 






Output \jriable t,(/) 

Input \imblc T,{t) Cauw amplitude cu^^e 

Cur\c shape Typtcal features phase cur\-cIC(j<u)| Typical fciturcs Remarks 



tdeol 



SIGNALS AND SYSTEMS IN THE TIME AND FREQUENCY DOMAINS 


175 



Figure 4.47 Principle of optimization 



Figure 4.48 Mean square error generation 



Figure 4.49 Behaviour of the components of the error as 
a function of the degree of correction 
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Figure 4 50 Physical explanation ofthe increase of the 
noise in case of correction 


correspondingly up to its limiting frequency However, this inevitably 

causes the spectral frequencies of the disturbances that fall into this range to be 
raised so that the error component caused by the disturbance becomes the 
greater the higher the value selected for tt)* Therefore, there will always be a 
minimum for the total errorP = ^ + F.. this minimum being deeper and more 
pronounced the lower the inherent disturbances of the uncorrected system and 
the better the dynamic behaviour of this system This minimum corresponds to 
the case of an optimum filter according to Schlitt (1960) 

As an example consider the first order system treated already in Sections 433 
and 4 3 4 The frequency response is 

G(J(»)= , (4 56b) 

1 + jcoT 

Let be a white-structure signal applied to the input, let the band be limited to 
(Oj, and let w hite noise be the type of interference F urthermore, no correlation 

IS assumed between the signal and the interference 
The correction programme of a series-connected system that can be realized, 
in tbis case reads 


the factor 


Ck(j<j) = 


I 

1 + jcoTi 


T/Tt = o), J(o^ 


(4 56c) 

(4 56d) 


indicating the improvement of the bandwidth by the correction If a micro 
computer is used for the realization of equation (4 56c) the limit frequency of the 
analog digital-analog conversion, with the sampling time r^, is 
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Thus, the mean square error is 




^ dm + m,, — cOj. , 

, 1 + co^T? 


Jo ' 

r-^ ' 1 + m^T ^ 

Jo 1 + oj^Tl 


(4.56e) 


In order to obtain values that can be compared with the original system and 
enable an estimate of the efficiency of the correction to be generated, we calcu- 
late the error of the system without any correction while assuming a bandwidth 
limited to = l/T: 


■» = 


1 — ^ — — dm + 

I +jmT 


/•a>x \ 

dm + 

JlOc / ‘'0 


foj^ n 


(4.56f) 


To find the more favourable solution in each case, and thus obtain suggestions 
for synthesis, consider the following two cases: (a) the limiting frequency m^.s, 
i.e. the sampling frequency, be adapted to the limiting frequency of the corrected 
system mcj;; and (b) the limiting frequency m^^s be adapted to the limiting 
frequency of the signal m,,: 

(a) me,s = C0c,k- From equation (4.56e), for m^.s == m^.k < oi*, one obtains 


£ = 


, m? 


1 - T + — - 1 + 

4/ cuc.k 


(4.56g) 


and for m^.s = m^. ^ > respectively. 


= S,.„m„ 


— arctan 


^zo rUc, k 


1 __ 


(4.56h) 


(b) ®c.s = 


= S,„m„ 


_ arctan 

to } _r^c,k \^Q,k/ _ 


+ — arctan 


(4.56i) 


Any existing amplification or attenuation in the original system (static trans- 
mission factor) may be considered in the usual manner in the signal to noise 
ratio S^JS.^. In Figure 4.51a,b,c, the results obtained from equations (4.56g,h) 
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(b) 




Figure 4 51 Dependence of relative mean square error on the bandwidth improved by 
correction SJS,„ = 10^ SJS,^ = 10 *. — SJS,^ = 10®, SJS,, = 

10 ®, (a) = 10 , (b) 0 */QJe = 10*, (c) = 10* 


are shown for different values of the signal lo noise ratio of 20, 40, 60, and 80 dB, 
1 e for = 10^, 10*, 10®, and 10®, m relation to the bandwidth increase by 

correction of co^ The parameters of oyjo}^ were selected such that dy- 
namically good systems (Figure 4 51a) as well as dynamically poor systems 
(Figure 4 51c) are involved The values Jor the mean square error are related 

to the error of the uncorrected system according to equation (4 56f ) so as to 
indicate directly the reduction of the deviation 
All the diagrams for the adaptation of the bandwidth co^ ^ to the bandwidth 
of the programme for system correction Wc , (case (a)) reveal a minimum cor- 
responding to the case of an optimum filter while the dynamic portion of error 
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(first term equation (4 56e)) decreases with rising degree of correction, the 
interference-dependent portion increases with rising degree of correction, this 
being due to the increase m the spectral portions a > The efficiency of the 
correction, therefore, will be the higher, the dynamically better the system and 
the smaller the interferences This substantiates the finding obtained by Woschini 
(1969) on the basis of physical considerations 

By adapting the bandwidth ^ I® of *he input signal (case (b)), the 
results shown in Figure 5 52a.b,c are obtained from equation (4 56i) The para- 
meters were selected such that direct comparison with the results represented 
in Figure4 51 is possible Incontrastiocase(a),dependencies are obtained which 
tend asymptotically to a limit value, this is because the interference-dependent 
portion does not rise any more because of band limitation In dynamically, \ery 
poor systems, that is for the case of large values of (Figure 4 52b, c), the 
error increases with correction In these cases the increase m the fraction of 
errors caused by interference predominates because of the increase in the high 
spectral frequencies 

Finally, it should be emphasized that, in practice, further limitations in 
efficiency occur due to the sensitivity of parameters which m this investigation 
hate not been taken into consideration and may arise due to possibly existing 
non-linearities (Woschni, 1967) 

4.4 COMMUNICATION AND INFORMATION THEORY 
4.4.1 Communication Theor) 

The task of communication is the transmission of a message from an information 
source to a receiver as shown in Figure 4 53 The output of this information 
source may be a digital or continuous signal, as treated in Section 4 2 In measure- 
ment it IS the unknown quantity that is to be measured the output is generated 
by a random mechanism having a probabilistic nature Otherwise the signal 



Figure 4 53 Communicalion system 
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would be completely known and there would be no need to obtain the output 
by means of measurement. The signal coming from the information source is 
the input signal of the encoder, modulator or transmitter. If the signal is digital 
the term encoder is used while for analogue signals modulator is used. This 
subsystem is treated in Sections 4.4.4 and 4.4.5. The function is to transform 
the signal, if there are several information sources to multiplex them onto the 
same transmission channel, or to make the signal immune to disturbances. 

The modulated or coded signal is transmitted by the channel or processed 
by the processor. This channel may be a microwave or u.h.f. relay link, a wire 
or cable transmission as is today common in measurement, or a waveguide 
transmission for broad-band signals. In the future lightguide optical transmis- 
sion will gain importance for transmission of measured data because of its 
high immunity to electromagnetic disturbances. The noise is considered to be 
additative. In measurement the signal is often processed by a microprocessor 
containing a memory. As in the case of an analog system the behaviour of this 
system may be described by a transfer function provided the computer program 
is a linear (Woschni, 1981). 

The next link in the serial chain is the decoder or demodulator, the task of 
which is to construct an estimate of the original signal as correctly as possible. 
In the sense of describing the signal as a vector in the signal space (Section 4.2) 
this means that the distance between the output and input signals should be a 
minimum. Many similarities link this problem to the problems of character 
recognition (Finkelstein, 1976). 

4.4.2 Information Theory 

If the number of the different possible symbols that the information source is 
able to deliver is m, and if there is no a priori information about the probabilities 
of the different symbols at the receiver before receiving the signals, all possible 
symbols at the receiver have the same probability: 

Pi = P 2 = • • • = A- = 1/hi (4.57a) 

with the normalization 

m 

Zp.= 1 (4.57b) 

1= I 

The amount of information in message i is defined as 

I = log,o(l/P.-) (4-58a) 

The base of the logarithm is often chosen as 2 (binary logarithm) because in 
many practical systems two stable states are used. The measure of the amount 
of information is therefore given by the information necessary to decide be- 
tween two possible states and is called a bit (from ‘binary digit’). 
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The average information content over all source symbols is given by the 
source entropy H where 

H = - Xp.IogjoP. (4 58b) 

1=1 


H IS a maximum when all symbols are equally likely Then the maximum 
possible value Hq is given by 

Ho = logj m = lb m bit/symbol (4 58c) 

(as In = loge, then lb = logj, and Id = log,o) For the two-symbol (binary) 
source, Hq is equal to 1 (maximum uncertainly or average information per 
symbol) This average information content per symbol, the entropy H, gives the 
number of binary decisions which are necessary on the average to distinguish 
one state out of the ensemble of all possible states 
The above definitions are given for discrete sources For analog signals a 
differential entropy of a continuous distribution is defined (Goldman, 1953) 

HM = - J ”iv(x) lb[n(x)] dx (4 58d) 


Entropy of a continuous distribution is a maximum (optimal coding Shannon, 
1948) for systems with amplitude limitation if p(x) = constant and for systems 
with power limitation if p{x) is a Gaussian distribution (Shannon, 1948, 
Femstcin, 1958, Woschni, 1973) As a measure of the difference between the 
maximum value Hq and the real value H the redundancy AH, given by 


AH = Hq ~ H bit/valuc 
or the relative redundancy A/i is used, where 


A/,=^ = '^=1-A 

Ho Ho Ho 


(4 59a) 


(459b) 


A binary source, for instance, has entropy H(p) shown m Figure 4 54 
Generalizing the entropy for more than one variable, Xi, X 2 , 
results in entropies of higher order This plays an important role in analysing 
digital signals where correlation exists between several bits m the signal sequence 
It IS also of value in relating signals at the input and output of systems (problems 
of signal transmission) Let the joint probability be 

p(x„x„ ,x„ ,x.) (4 601) 

and the conditional probability be 

.X.) (4 60b) 


KXilXj.Xj, ,x„ 
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Figure 4.54 Entropy and redundancy of a binary 
source 

For the entropies it follows that the joint entropy is given by 

X lb[p(Xi,X 2 ,...,A„)] (4-60c) 

and the conditional entropy is 

H(Xi1X2,X3,...,xJ= -££••• p(Xi,X2,....X„) 

xlbl>(xilx2,...,x„)] (4-60d) 

In Table 4.14 the equations of the entropy are placed ^°Sether 

4.55 when expressed in ierms of the ,t‘ ,rans- 

channel with Gaussian signals With power Pj p-ann 1Q61- Woschni 

information follows (Shannon, 1948; Goldman, 1953; Fano, 1961, Woschni, 
1973). 

H(x; y) = ilb(^l + bit/value (4-60e) 

Examples in the field of measurement are treated in ^ referred to one 

The transinformation H(x; y) states the information content r fejredj^o^o^^^ 

transmitted value (measure: bit/value). If the transient ti 
given as 1,^ = 1/2/c the channel is transmitting 

j ^ = 2 /cH(x; y) 


(4.61a) 



Table 4 14 Entropies for two e\ents x and y 


184 


HANDBOOK OF MEASUREMENT SCIENCE 




185 


A® ™ ™ 



Q (Shannon, 1948), given by 

^ H(x;y)max ^ 2 /cH(x; y)niax (4.61b) 

equation (4.60e, the channi capacity of a channel with white noise is 
This very important •classicar 

The relation shows that it is possible to exctiang & 

bandwidth/,, and vice versa. to errors. 

Section 6.4 explores information theory pp 

4.4.3 Applications to Measurement 

In measurement the same general problem exists as ^ ^ the out- 

link (refer to Figure 4.53). The input is the of subsystems 

put is the measured value. Measuremen sys .^^^^^^j^j^g^^tion between two 
connected in series (see Section 4.3.z;. nnints of interconnection 

systemstheproblemofinterface arises. Ateachofthesepo t 

the condition must be met that the informatton Jd be lost, 

to be the same as that of the first system, ot erwi equal The interface 

That means the channel capacities of the 7 

problem itself is solved by means of coding (see ec i 
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Figure 4 S6 Explanation of the maximum 
number of amplitude steps 


A measure that leads directly to information content is that of the number of 
distinguishable amplitude steps ra. or power steps nip With a given limit for 
the output power ±P^ or deflection of the measuring instrument, the 
maximum number of steps is determined by the mean square error or the 
amplitude error Ax, As Figure 4 56 shows (Woschni, 1972b) 

m, =* 1 + nip s= 1 + PJt^ (4 62a) 

the fact being taken into account for addend 1 that ‘0’ is also a possible measured 
value 

Using the results of information theory some very significant implications 
may be derived concerning storage of measured data m practical applications 
For storing m, equally probable values we require 

5 = logj m, = lb m, (4 62b) 

binary storage locations (bit), because with s binary storage locations a total of 
2* combinations can be represented This number of storage locations is the 
decision content of information theory, s = f/o Since with a given number wip 
of power steps, ni, can be approximately computed as 

( P 

1 + (4 62c) 

The number of storage locations required for storing a measured value may be 
assessed by means of equation (4 62b) Thus, for instance, a measuring instru- 
ment with an amplitudeerror of Ax/X = 1 % necessitates Ho = s = log^ 101 = 
3 32193 logio 101 6 64 bits, that is, seven binary storage locations for storing 

one measured value 

On the other hand, a setting accuracy within, for instance, 10"® error can be 
achieved with a punch tape having 3 32193 logi® 10* es 20 punching possibilities 
per value These intuitive considerations have a direct relation to information 
theory According to equation (4 61a) one obtains, as the information flow 
provided by the measuring instrument m the most favourable conditions, an 
approximation with equation (4 62c) 

/ = Xlb(.+^) 


(4 62d) 
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If the error consists of the noise P„ only, equation (4.62d) becomes the 
relationship of Shannons’ channel capacity C„ introduced as equation (4.61c): 

c. = /elb(l+^j (4.62e) 

Thus, a measuring instrument with a signal-to-noise ratio of 60 dB ( = mp = 
10® = = 10^) and a limiting frequency of 10 kHz yields a channel capacity 

of 200 kbit/s. A human being can consciously process only about 20 bit/s. 
Therefore, the measured values would have to be processed by a computer or 
stored on a magnetic tape {C^ = 200 kbit/s to 10 Mbit/s). In practice, the values 
for the information flow are mostly lower by I to 3 orders of magnitude, since 
the degree to which the measuring instruments can be adapted to the signals is 
far from the optimum ; signals contain a large amount of redundant information. 

4.4.4 Coding Theory 

As treated in Section 4.4.1 the first subsystem of a communication system is the 
encoder, converting the input signal into a series of code words. The task of this 
coding is the adaptation (interfacing) of the information source to the channel 
or processor. 

In communication systems a redundancy-diminishing (optimal) source 
coding, having the purpose of economizing the time for communication, often 
plays an important role. In measurements, security of the message against 
disturbances is the general criterion. Here, therefore, error-detecting or error- 
correcting codes are applied (Peterson, 1962). 

For the representation of codes geometrical descriptions codegraphs, in the 
n-dimensional space, are used (see Section 4.2.6). Coding and decoding theorems 
exist. The decoding theorem deals with the problem of identifying a codeword 
by the receiver. For this purpose the decoder compares the incoming code 
words with the words of the code alphabet deciding which code word the trans- 
mitter has sent. In the case where the end of a code word is not marked by a 
special symbol, only the end-points of the codegraph may be filled with a code 
word, otherwise a part of a code word would be another code word. The equation 
that guarantees this is 

K 

X M-'" ^ 1 (4.63a) 

k=l 

where M is the number of symbols and 4 is the length of the kth code word. 
In equation (4.63a) the equality sign represents the most advantageous case 
without code redundancy. Otherwise the factor c in the equation 

K 

c = 1 

(.=1 

is a measure of the code redundancy. 


(4.63b) 
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This decoding theorem is of great importance in measurement because codes 
with redundancy are often used The theorem of optimal coding plays a great 
role in redundancy diminishing coding used in communication, giving an 
equation for the optimum length of the source code words 


Ibpjt 

lb Af 




(464) 


where is the probability of the appearance of the Ath code word Some im- 
portant codes, including those applied in measurement, are now considered 
The most simple code is the counting code, mostly seen in decimal counting 
Today this very easily learnable code is displaced in machines by the binary- 
coded decimal notation because of the smaller number of bits As an example, in 
Figure 4 57, the 1 out-oMO code is presented For manual data coding (data 
input) a particular form of the binary code 

= + ■ + +/l.2' + /l„2» = .l,/l,-. 4.^0 (4 65) 

the binary-coded decimal system (BCD code), is used Here digit-by-digit the 
decimal number is converted into the binary code For each digit four bits, a 
so called tetrad, are necessary (Table 4 1 5) 

In binary notation the complement, necessary for subtraction in a computer, 
sometimes leads to a non-existent code word The compliment of 3(=0011), 
for instance, is 1 100 which does not exist in the BCD code (Table 4 15) This 
disadvantage is avoided by use of the notation of the BCD code, also presented 
in Table 4 15 To each decimal number 3 is added This code is usually used for 
data input 

Another group of codes are the reflected codes, arising by counting at first 
forward, that is from 0 to 9, and then backward from 19 to 10 One of this group 
IS the Gray code It is obtained from the binary code as shown m Table 4 16 
The advantage of this code is that any two code words following each other 
always have unit distance between them, that is the two code words differ in one 


96765lt3?10 

olo 0 0 0 0 0 0 \ 

1 ooooooooto 
2 0000000100 
3 DOOOOOiOOO 
ifOOOOOIOOOO 
50000100000 
&oootoooooo 
70010000000 
8 0100000000 
9 11000000000 


Figure 457 loutof-10 
code 
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Table 4.15 BCD code in binary and excess-three notation 



Binary-coded decimal code 

Excess-three notation 



Second tetrad 


Second tetrad 

Decimal code 

First tetrad 

X3X2X1X0 

First tetrad 

VsFzTiFo 

0 

0000 

0000 

0000 

0011 

1 

0000 

0001 

0000 

0100 

2 

0000 

0010 

0000 

0101 

3 

0000 

0011 

0000 

0110 

4 

0000 

0100 

0000 

0111 

5 

0000 

0101 

0000 

1000 

6 

0000 

0110 

0000 

1001 

7 

0000 

0111 

0000 

1010 

8 

0000 

1000 

0000 

1011 

9 

0000 

1001 

0000 

1100 

10 

0001 

0000 

0100 

0011 

11 

0001 

0001 

0100 

0100 

20 

0010 

0000 

0101 

0011 

50 

0101 

0000 

1000 

0011 

51 

0101 

0001 

1000 

0100 

76 

0111 

0110 

1010 

1001 

99 

1001 

1001 

1100 

1100 


digit only. For this reason this code is often used in measurement for encoding 
disks or linear encoding scales (Figure 4.58). The disadvantage of a distance 
greater than 1 between 9 and 10 occurring is avoided by using the improved 
Glixon code (Table 4. 1 7). For special purposes other codes are used, for example, 
the teletype CCITT code No 3 or, for data transmission, the ISO-CCITT code 
No. 5 (Steinbuch and Weber, 1974). 

During data input, transmission or processing procedures errors may arise 
because of such defects as a wrong perforation of a punched card. Error- 
detecting and error-correcting codes having additional code redundancy have 
been designed. The Hamming distance i.e. the minimum distance between 


Table 4.16 Formation of the Gray code 


Decimal number 

0 

1 

2 

3 

4 

5 

6 

7 

Binary number 

000 

001 

010 

oil 

100 

101 

no 

111 

Shifted binary number 

000 

001 

010 

oil 

100 

101 

no 

111 

S 

0000 

00011 

0110 

0101 

1100 

nil 

1010 

1001 

Gray code 

000 

001 

on 

gio 

iio 

lU 

im 

ITO 
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two code words of an alphabet, has to be (Peterson, 1962), for error-detecting 
codes with the degree of errors to be detected. 

<C.n = /d+l (466a) 

and for error-correcting codes if is the degree of errors to be corrected 

= (4 66b) 

If correction to the degree /? < /. only is used, it is possible to detect additional 
errors up to degree 




(4 66c) 


Table 4 17 Ghxon code 


Decimal number 

Glixon code 

0 

0000 

1 

0001 

2 

0011 

3 

0010 

4 

0110 

5 

0111 

6 

0101 

7 

0100 

8 

1100 

9 

1000 
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A ver>' simple but often used method for data inputting, is the addition of parity 
bits (parity check). This additional bit is chosen in such a way that the sum of all 
digits (the so called weight of the code) is either an even or an odd number. 
Table 4. 1 8 presents the BCD code with parity check, detecting all errors with odd 
weight (1,3,...). This code is often used, with e.xcess-three basis, for data input 
as punched cards or punched tapes 

Other and more complicated error-detecting or error-correcting codes are 
the selector code (u--out-of-n code), the Hamming codes produced by feed- 
back shift registers (Hamming, 1950), the recurrent or cyclic codes (Peterson, 
1962). and codes with block protection (Wozencraft and Reilfen. 1961). 

As an e,xample the conversion of the excess-three code to the binary code is 
now considered. With the relationships between .v and ,v shown. Table 4.15 
yields in Boolean algebra form 

-'•‘o = J'o 

-\'i =(.Vo-3’t)T(Vo'Vi) 

-^’2 ~ (Ji ■^2) T (J'o '^2) 4 - (Vq -Vj • y^) 

-Y 3 = 0-2 -.Va) + 0'o-.Vi -yj) 


where 


• is the logical AND operation 
+ is the logical OR operation 

The realization of these logic equations leads to the circuit of Figure 4.59. 


Table 4.18 BCD code with parity check 


First tetrad 

Second tetrad 

Parity check 

Decimal number 


0000 

0000 

1 

0 

1 

0000 

0001 

0 

1 

1 

0000 

0010 

0 

2 

1 

0000 

0011 

1 

3 

3 

0000 

0100 

0 

4 

I 

0000 

0101 

1 

5 

3 

0000 

1001 

1 

9 

3 

0001 

0000 

0 

10 

1 

0001 

0001 

1 

11 

3 

0010 

0000 

0 

20 

1 

0101 

0000 

1 

50 

3 

1001 

1001 

1 

99 

5 
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ORgote 
ANDgote 
— [>- INVEf?7 


Figure 4 59 Circuit for the conversion of the excess three 
code to the binary code 


4.4 J Modulation Theory 

A special form of adaptation of the source to the channel is that of modulation 
Transmission of signals from several sources over a single channel can be 
accomplished using frequency-division or time-division multiplexing systems 
In measurement the problem of transmitting the output signals of many 
sensors or transmitters o\er one line is often solved by means of the time- 
division (time sharing) method, shown in Figure 4 60 with the pulse interval 
and the period Pulse modulation, with interleaving of the different signals, 
IS applied there for parallel-to-serial conversion In general a modulator may be 
interpreted as a controlled system with the carrier signal as one input and the 
modulation signal x(f) as the control input (Figure 4 61) In the following a 
survey is first given of the several kinds of modulation 
In the analog modulation methods one, or several, parameters of the sinusoidal 
oscillation »(f), termed the carrier osallation, with the carrier frequency flo 

u(0 = U sinffiof + <l>)=0 sm 0(f) (4 67a) 

are caused to vary by the modulation signal x(f) If the amplitude 0 is altered 
by the input signal x(f), amplitude modulation (AM) results 

t> = /(x(0) 


(4 67b) 
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(b) 



Figure 4.60 Time-division multiplexing: (a) system; (b) pulse frame (for ten trans- 
mitters) 


Angle modulation is generated by using x(t) to vary the argument of u(t): 

9(t) = /(x(f)) (4.67c) 

As 0 = Qq f + ^) two kinds of angle modulation exist. 

Frequency modulation (FM) occurs when Qq is varied as 

m = /(x(£)) (4.67d) 

Phase modulation (PM) occurs when <p is varied as 

m = mt)) (4.67e) 

Figure 4.62 shows the modulation methods mentioned above for a sinusoidal 
modulation signal with the modulation frequency a>: 

x{t) = sin(a)l) (4.67f) 

Pulse modulation methods use the principle of sampling (see Chapters 5 
and 12). 



Modulation signal 

Figure 4.61 Generalized modulation system 
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(a) 



/ 


Figure 4.63 Pulse modulation methods (tj = 
sampling time): (a) modulation signal x(t); 

(b) carrier pulse sequence; (c) pulse-amplitude 
modulation; (d) pulse-duration or pulse-width 
modulation; (e) pulse-phase or pulse-position 
modulation 

In the usual case a pulse-amplitude modulation signal is first generated, this 
then being converted to a coded pulse sequence using one of the codes of Section 
4.4.4. 

We now deal with some details such as the bandwidth necessary and applica- 
tions to measurement. 

A sinusoidal modulation signal 

x{i) = ^ sin(ft»l -f (p) 
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Figure 464 Pulse-code modulaiion (a) modulation 
Signal, (b) earner puJse sequence, (c) pulse-amplitude 
modulation, (d) pulse-code modulation (binary code) 


operating on a carrier Cq sin(I5o/) yields the amplitude modulated osciilation 

u{t) = Ootl -V msin(<ot + {468a) 

where the modulation depth m = kX The representation of equation (4 68a) 
m the frequency domain is (Woschnt, 1973, Wozencraft and Jacobs, 1965) 

u(t) = OoisinCHoO ± h" cos[(fio + + ^]} (4 68b) 

show’ing that the resultant is a signal with the earner frequency and two side 
frequencies (Figure 4 65a) The composition of the several spectral frequencies 
provides the time function, as shown m Figure 4 65b 
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(a) 



Figure 4.65 (a) Frequency spectrum of 
sinusoidal amplitude modulation; (b) 
indicator representation 


In the general case of an input signal with bandwidth cu j — cUc to be modulated, 
the bandwidth needed by the transmission link is 

b = 2a»c — 1^0 ± Wc (4.68c) 

around the carrier frequency Qq . An amplifier has to have at least this bandwidth 
otherwise distortion of the original form of the modulated signal will arise in the 
later recovered signal (Woschni, 1981). 

In measurement, amplitude modulation results at the output of a bridge 
operating with inductive sensors, as presented in Figure 4.66. To obtain satis- 
factory dynamic behaviour the condition between the limiting frequency of 
the measured input co^ and the carrier frequency CIq needs to be 


Qq ^ 5(u, 


(4.68d) 
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Figure 4 66 Bridge circuit, 
delivering an amplitude modu- 
lation 


If this condition is not met jt will not be possible to correctly demodulate the 
amplitude'modulated oscillation 

For the operation of capacitive sensors having high sensitivity and m certain 
cases for inductive sensors, frequency modulation is used (Figure 4 67) For 
sinusoidal variation of the capacitance of the sensor 

C = Q + AC = Co^l + ^sin(DJt)j (4 69a) 


the variation of the natural frequency = l/,/(LC) is given by (Woschni, 1962) 

n = n. + An s,nm = [l - ^ s.„(a«) + I (|r)’ s.n^(c»,) - ] 

(4 69b) 



Figure467 Circiutforthe 
operation of a capacitive 
sensor to provide 
frequency-modulated out- 
put 
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Because of the non-linear characteristic differential capacitances sensing 
methods are used. The time function yields 


u{t) 


= Uq J sin(cul)] dt 


= Uq sin( “ ~~ cos(a)t) 


(4.69c) 


The corresponding function for sinusoidal phase modulation is given by 

u(t) = Uq sin[fto' + sin(<yf)] (4.69d) 

A comparison between both equations shows that 

AQ/co = A<p (4.69e) 

represents the equivalent phase deviation (modulation index). Therefore the 
relationships contained in Table 4.19, between frequency and phase modulation, 


Table 4.19 Relations between frequency 
and phase modulation 



Frequency 

modulation 

Phase 

modulation 

Frequency 

deviation 

AQ 

AQ = AOo) 

Phase 

Af2 


deviation 

AO = — 

0 ) 

AO 


are valid (Woschni, 1962). The spectrum of a frequency- or phase-modulated 
signal is derived by means of a series expansion of Bessel functions (Woschni, 
1962), leading to a bandwidth necessary for distortion-free transmission given by 


where I < k < 2. 


b — 2[AQ -f ko)] 


(4.69f) 


Modulation - 

Limiter converter demodulation 



Figure 4.68 Principle of demodulation of frequency modulation 
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The demodulation of a frequcnc> -modulated oscillation is realized as demon- 
strated m Figure 4 68 A limiting stage is followed b} a modulation converter 
converting the frequenc> modulation into an amplitude modulation which 
rectified bv means of a diode Some example circuits realizing the demodulation 
are shown in Figure 4 69 O'^oschm, 1962) 
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Chapter 



M.J. MILLER 


Discrete Signals and Frequency 
Spectra 


Editorial introduction 

Rapid advances in data processing methods— both at the conceptual, procedural level of 
theoretical understanding and at the hardware implementation stage— have given measure- 
ment system designers truly great power to implement, for reasonable cost, advanced 
processmg procedures. These advances have occurred predominantly for the discrete 
form of electric signal. 

This chapter outlines the fundamentals required in understanding and making ap- 
propriate use of the digital techniques now rapidly coming into routine use even in instru- 
ments at the bottom end of the price range. It will be seen that practical implementation 
requires application of certain methodologies and that severe errors in interpreting the 
processed data can occur if the methods are not used appropriately. 

It extends some of the material of the previous chapter. As time passes the material will 
become increasingly more important as advanced signal processing finds yet more applica- 
tion and greater use of the digital signal format. An appreciation of this trend is to be 
found in Oppenheim (1978), a text presenting chapters devoted to digital signal processing 
in a range of diverse application areas. An extensive review of the mathematics required 
is available in Rader and McClennan (1979). General textbooks, that include many worked 
examples, are Oppenheim and Schafer (1975) and Rabiner and Gold (1975). 


5.1 INTRODUCTION 

The availability of the microcomputer and other relatively inexpensive digital 
electronic hardware has resulted in the emergence of new approaches to the 
signal processing problems that occur in many measurement systems. Digital 
signal processing is fast replacing conventional analog techniques in spectrum 
analysers and other signal processors used in a wide variety of applications and 
across many disciplines. The availability of instrumentation capaWe of carrying 
out a fast and efficient calculation of the Fourier transform has provided the 
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means for readily displaying signals as either time functions or m dynamically 
updated frequency spectrum form This equipment finds applications in a wide 
range of disciplines such as m analysis of vibrations for geological research, 
mechanical engineering, sonar technology or chemistry The time and frequency 
domain properties of signals are also important to the radio-astronomer m 
characterizing signal sources, to the neuropsychologist for analysing electro- 
encephalograms or to the engineer concerned with processing and synthesis of 
speech 

This chapter examines the principles involved m using digital processing for 
performing operations such as the Fourier transform Such computer-based 
processing implies that the signals under consideration must be in discrete form, 
that is in the form of finite sequences of discrete quantities Throughout this 
chapter it is assumed that the signals are available m discrete form but these 
techniques can be readily applied to continuous or analog waveforms provided 
appropriate analog to-digital (A/D) conversion is used as discussed m Chapter 
12 Furthermore it is assumed that the sequences have discrete values in time 
but continuous real amplitude values, that is as though there were no quantiza- 
tion of the sample amplitudes (Some authors use the term discrete to describe 
such signals and digital to describe sampled and quantized signals) This 
infinite-bit-precision assumption simplifies the study of processing procedures 
and permits a larger general body of theory to be used 

Fourier transform techniques, particularly the discrete Fourier transform 
(DFT) will be the central theme in the description of the processing tools 
available for the discrete signal domain The DFT has been used in practical 
measurement applications for many years but it usually required relatively 
expensive special-purpose computer facilities with appropriate software 
development The emergence m the I960’s of the fast Fourier transform (FFT) 
algorithm for drastically improving the computational speed associated with 
the DFT operation has completely changed the situation The FFT and the 
availability of the microcomputer brought about a very rapid expansion of 
< 3 / i’Ause’ pn^cessfrrg’ CecAiiiq'a^r FFT arrai’ysers' acv acts' 

readily available in instrumentation form at moderate price 

In this chapter the principles of these discrete Fourier transform techniques 
will be discussed, particularly as they are applied m the area of spectrum analysis 
It has now become relatively common for measurement systems to include FFT 
analyser instruments or else an equivalent software package on a mam frame 
computer The nature of the DFT process is such however, that unless special 
care is taken, the resultant frequency spectrum estimates may be considerably 
in error and, even more annoying, calculations using different sets of data 
samples from the same signal or process may yield vastly different results This 
chapter gives particular attention to the mterpretation of the steps involved in 
discrete Fourier transformations and to the practical implications for the user 
Initially the DFT processing relationships are stated without proof and their 
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key properties summarized. Then a descriptive/graphical interpretation is given 
of the process to lay a firm foundation for understanding the theoretical develop- 
ment that follows. 

Throughout it is assumed that the signals to be encountered in practice, 
whether they be radar or sonar echoes, electrical responses from voice or 
biomedical systems or outputs from mechanical transducers, will usually be 
randomly time varying. Use will therefore be made of statistical procedures 
(based on material presented in Chapter 6) especially in dealing with the inter- 
pretation of results and describing techniques for reducing errors in the spectral 
estimation procedures. 

Another of the most notable areas of development in discrete system theory 
and techniques has been in the field of digital filters. Whilst reference is made in 
this chapter to some elementary ideas about digital filters, it is left to Chapter 10 
to deal more specifically with that topic. 

A great deal of the theory associated with discrete-time signals and linear 
systems will be familiar to the person who is well versed in traditional analog 
theory of linear systems ; as presented in Chapter 4. The description of signals 
in the time and frequency domains and the use of the important connections 
between multiplication and convolution operations will be assumed well 
understood in what follows. Background is available in Chapter 4. 

5.2 DISCRETE TIME SEQUENCES 

A discrete signal is a sequence of numbers or sample values spaced usually at 
uniform intervals of time as illustrated by the sample sequence x(k7^) at the 
A/D converter output in Figure 5.1, where x(k'^) is the sequence of sample 
values x(0), x(7^), x(27^), . . . and 7^ is the sampling time interval. In practice 
most A/D converter outputs are in the form of sequences of multiple-bit binary 
words, one word per signal sample and each word appearing as a set of I’s and 
O’s on parallel output lines. The sequence x(/c7^) as shown in Figure 5.1 is 
intended, therefore, only to represent a set of discrete sequential signal values 
and not necessarily an actual waveform. Furthermore, as previously mentioned, 
the amplitudes of the sequence values are assumed continuous, whereas any 
practical converter will have a finite number of quantization levels. Discrete 
sequences of interest here are not necessarily restricted to sampled values of 
analog signals. A sequence x(/c7^) may be a set of numbers . . . x(0), x(7^). 




* > 


\ _ X(/) 

A/D 

xV(T,) 

llll,. 

V/ / 

Converter 




Figure 5.1 Continuous and discrete signals 
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x(2Tf), corresponding, say, to \ehicle speed data or sunspot observations 
being processed in a computer 

Discrete tune signals are defined only at discrete values of the independent 
\anable, tune, that is at t = kT^, where k is an integer and T^ is the mterval 
betVr een samples For example 7^ may be an iater> al of 24 hours if the sequence 
x(kT^ represents daily sunspot readings 

Several notational forms may be used to descnbe such a sequence including 
the following 

x(kTt) or {t(A,7^)] which imply uniform spacing and or x(k) or 
{x(/c)} which maj apply to uniform or non uniform spacmg. 

Uniform spacing will be assumed in what follows and smce the sampimg 
mterval (^) appears as a constant multiplier factor, it is often convenient to 
assume it umtj for many calculation purposes 


S3 THE DISCRETE FOURIER TRAiNSFORM SUMMARIZED 


For a tune sequence x(t7J consisting of N samples uniform]) spaced 7^ seconds 
apart, the discrete-tirae to discrete frequency Founer transform pair most 
commonl) used is given by 




n s= 0, 1, 

A * I/T. 


A. = 0, 1, 


.N-\ 


,N - I 


(51a) 

(51b) 


Figure 52 illustrates an application of this transform in displaymg the frequency 
components of a signal on a t)pical FFT analyser 
It IS left to a later stage to show how the DFT equations (5 1) are developed 
for discrete signals At this point led us summarize their unportant features 


(a) The DFT equations (5 1) are of similar form to the well known Founer 
transforms for analog signals 


X(f) = 

[ ;t(0e-'="'dt 

(5^) 

1(0 = 


(52b) 


Note, however, that the exponential terms m (5 1) do not contam an 
expbcit/, or 7^ term As will be explamed later, the n and k parameters have 
the connotations of frequency and tune respectively 
(b) The DFT transforms an N pomt discrete-time sequence 

x(_kT,) = x(0), x(rj, X(2U , x((N - 1)T,) 
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Figure 5.2 Typical FFT analyzer (by permission of Briiel and Kjaer) 


into an Af-point discrete-frequency domain sequence 



= 2r(0), x(^), X 


X 


\ N - l)/s 

N 


(c) 


(d) 


An example of a DFT pair is illustrated in Figure 5.3. The DFT of a 
sequence of real time values results in a sequence of complex frequency 
values, commonly represented by separate plots of magnitude |X(n/s/iV)| 
and phase 6x{nfJN). (It is recommended as highly instructive for the 
uninitiated reader to check the values shown in Figure 5.3— a hand 
calculator would suffice.) 

In practice, the signal sequence x(/c7^) is only a segment of the much longer 
sequence which may possibly arise from the particular system generating 
x(kTs). The x(kT^) sample shown in Figure 5.3 can be thought of con- 
viently in terms of the longer x signal sequence multiplied by a truncating 
window function So(0 being Tq = NT^ seconds long and shown dotted in 
Figure 5.3 where 


5o(0 = 


~2^s < f < Tq 
Otherwise 


~T 

Z-'s 


The end points of the truncation window So(f) are conceived to lie at the 
midpoint of adjacent samples. 
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Figure 5 3 Typical discrete signal sample x(kT,) and transform X{nfJN) 

(e) The spacing between the frequency sequence values is inversely propor- 
tional to the sample length (To) since 

r- /» 1 1 

Frequency spacing = — = — = — 

Clearly, the longer the sequence length, the finer the frequency resolution 
in the transformed sequence 

(f) If the infinitely long sequence of which jc(A:T,) is a part, has random com- 
ponents throughout its entire history, then its frequency spectrum (that 
IS, Its Fourier transform) does not exist in any meaningful sense However, 
as will be shown later, the sequence X{nfjN) in the frequency domain can 
be used to estimate a quantity called the power spectral density function 
(to be defined), which does exist even for purely random sequences 


5 4 GRAPHICAL DEVELOPMENT OF THE DFT 


5.4.1 General Comment 

The discrete Founer transforms in equation (5 1) are a convenient but by no 
means unique form of DFT pair that could be used To see the significance of 
this let us examine the development of the discrete transform expressions and 
their user implications At the expense of some small additional complication, 
let us begin by assuming that an analog tune function x(r) is the starting point 
Recall that by no means all discrete sequences derive from sampled versions of 
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Figure 5.4 Illustrative Fourier transform pair 

an analog signal but the approach here, based on Brigham (1974), provides 
important insights. 

Consider the signal x(t) and its Fourier transform in Figure 5.4. The symbol-*->- 
is used to represent the Fourier operation. Actually the modulus | A'(/)| only is 
shown but this will illustrate the principles satisfactorily and in many practical 
cases is the only aspect of the frequency domain function that is of interest. Note 
also that this illustrative example assumes the form of x(t) and X{f) are known 
whereas, in practice, one or the other is usually unknown. 

Consideration is now given to what happens when basic operations are 
performed. 

5.4.2 Sampling: Aliasing Distortion 

Sampling is required, as discussed in Chapters 4 and 12, to convert x(t) to the 
(infinitely long) discrete signal xJlkX)- This is shown in Figure 5.5 where k takes 
integer values from zero to infinity (for causal signals). Note that x^kX) is taken 
to represent here a sampled function of time defined only for the discrete times 
t = kTg. What now is the Fourier transform of this sampled signal? As 
discussed in Chapter 4, we can write the sampled function xJ[kX) as the product 
of the original signal x(t) and a sampling function s(f) the latter being a series of 
delta functions each of unit area. Hence 

x^ikT;) = xmt) = X{t) f 5{t-kT,) (5.3) 

k= — CO 



Figure 5.5 Discrete-time function 
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Figure 5 6 The effect of sampling (a) Sampling function and transformation into fre- 
quency domain, (b) Sampled sequence and transformation into frequency domain 

Furthermore, since multiplication in the time domain implies convolution u 
the frequency domam we can wnte the Fourier transform X,(/) of x,(fcT,) as 

X.(/)=X(/).S(/) (54) 

where the symbol * represents the convolution operation Since sfr) and S{f) 
have the forms shown m Figure 5 6a it is a simple matter to carry out the necessary 
convolution of the original X(f) function with each S(f) delta function and 
then use superposition to obtain X^{f) m Figure 5 6b 
The follownng remarks can be made concerning the sampling process 

(a) sampling x(f) produced a sequence xJ^kT,) which has a periodic frequency 
spectrum such that knowledge of X,if) over the mterval 0 to 1/2% provides 
complete information about A',(/) 

(b) In the vicmity of 1/2% there is evidence of aliasing distortion which results 
from the fact that % could not satisfy the Nyquist cnterion In practice 
this distortion is usually immmized by mcludmg an anti-aliasmg, low-pass, 
filter pnor to the sampler unit 

5.4 J Truncation Window: Leakage Distortion 

Truncation is necessary to reduce the lime sequence to a finite length to allow 
computation of the transform to be undertaken practically. The truncated 
function IS 

x{k%)k^0,i, ,N-1 
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Figure 5.7 The effect of a truncation window; (a) truncation window and its transform; 
(b) truncated sequence and its transform 


as shown in Figure 5.7b. We can write x(/c7J) as the product of the infinite 
sequence and some truncation window function So(t) which is 1 over an 
interval from — to (N — \)T^ and has spectrum S^if). Hence 

x{m = xfkTMt) = xmOsoit) (5.5) 

with Fourier transform 

Xsif) = Xif) * Sif) * So(/) (5.6) 

where the modulus of So(f) is the ubiquitous sine function of /. Note that 
sine a = sin(7ta)/7ta. The subscript N in the symbol X ^(/) is intended to indicate 
that the Fourier transform is based on N samples only of x(fc7^). 

The following remarks can be made about the truncation process: 

(a) Truncation in time results in the second modification or distortion of the 
spectrum, called leakage. The effect of the rectangular time domain 
truncation window (Figure 5.7a) is equivalent in the frequency domain to 
convolving the sine function So(/) of the window with the spectrum 
-^s(/)' The effect is illustrated in Figure 5.7b by rippling distortion in the 
frequency domain. 

(b) If there were sharp transitions in the spectrum Xff), the convolution with 
So(/) will produce smoothing or blurring. For example, if the original time 
signal had contained a sinusoidal component (discrete-frequency com- 
ponent), convolution with the sine function So(f) would have resulted in 
each discrete spectral line (delta function) being replaced by a sine function 
as illustrated in Figure 5.8. 
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Actual spectrum Transform of window Tronsform 


Figure 5 8 Leakage distortion due to a truncation window 

This effect is called leakage because the truncation ‘filtering’ effect gives 
nse to a ‘leakage’ of power values from the original frequency into the 
neighbounng frequency regions Unless counteracted, this could limit the 
DFT spec,tT\im Vo a useCul dynamic range of less than 40 dB '«ivh 

poor selectivity (see e g Thrane, 1979) 

(c) The actual amount of leakage distortion introduced depends on the 
length (To) of the truncation window in comparison to the sampling 
interval (T,) In practice there will be trade-offs between the number of 
samples and the accuracy of the resultant transform We will see in Section 
5 6 how non-rectangular smoothing windows may be used to advantage to 
reduce undesirable effects of truncation 

SAA Frequency Domain Sampling 

We now have the Founer transform of x(kT,), the sampled truncated 
version of x(r) However the continuous (periodic) frequency function Xf,{f) 
is still not m a suitable form to represent computer calculations which can only 
be represented by a finite number of samples Hence we need to replace Xf,{f) 
by a discrete (sampled) version, say X{nfJN) This can be considered as equi- 
valent to the frequency domain sampling operation 

= (5 7) 

where, as illustrated m Figure 59a, S,(/) is a sampling (or discretization) 
function Its time domain equivalent Sj(t) is also shown Reasons for choosmg 
to sample the frequency function at integer multiples oifJN(Hz) will be discussed 
later where it will be shown that there are only N distinct frequency samples 
computable from the N time samples and that this frequency spaang will be 
adequate (in a Nyquist sense) to descnbe the spectrum 
The following remarks can be made about the process of discretization of the 
frequency function 

(a) The above operation is equivalent to the DFT analyser assuming that 
whatever sample segment of x(0 it takes and processes, the rest of x(f) is a 
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s,l/) 5,(/) 



Figure 5.9 Effect of sampling in the frequency domain; (a) frequency sampling 
function Sdf) and its transform 5[(f); (b) sampled spectrum \X(^nfJN)\ and its 

transform x^ikT,) 


periodic repetition of the sample function. To see this, note that the 
multiplication operation of equation (5.7) implies the convolution opera- 
tion in the time domain which we can write as 

Xp(k%) = xikT,) * Sf(t) (5.8) 

If Sf(f) is a series of relatively widely spaced delta functions (true since 
To TJ it is easy to see that Xp(k7^), the result of this convolution, is a 
periodic sampled time function with period (Tq) of the form shown in 
Figure 5.9b, that is 

XpikT^) = x{kT^ + mTo) m an integer (5.9) 

Thus although the x(t) values outside the truncation window interval 0 
to N7^(s) are unspecified, the DFT treats the original time function as 
though it were periodic with period Tq = NT^. Clearly there will, therefore, 
be differences between the spectrum of this fictitiously periodic signal and 
the actual signal being sampled. 

(b) As a special case, if the original continuous signal x{t) were periodic, this 
frequency sampling effect could be quite significant depending on whether 
or not the truncated sample of x{i) contains exactly an integral number of 
periods ofx(t). That is, if the truncation window is ‘fitted’ over the periodic 
function as illustrated in Figure 5.10a then the actual x{t) values are the 
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(b) 


/Ot;) 




Figure 5 10 Effect of position of truncation window for a periodic signal 
(a) truncation window width equal to two periods of x(/). (b) truncation window 
not equal to nT 


same as the fictitiously assumed values so no error is caused m the spectrum 
Consider, however, the situation as is depicted in Figure 5 10b where the 
periodic input is truncated to an interval not equal to a period The DFT 
computes the transform of a periodic function Xp(fcr,) with sharp dis- 
continuities and as expected, these discontmuilies give rise to additional 
(invalid) components in the frequency domain As will be discussed m 
Section 5 6, these effects can be considerably reduced by employing smooth- 
mg wiruiows which were referred to in the above discussion on leakage 
distortion 

The important points of this section are summarized as follows 
Periodic sampling of the time function every T^is) results in a penodic 
frequency transform with period 1/T, Aliasing distortion will occur unless 
the time function is band-hmited (or prefiltcred) to frequencies less than 
1/2T, 

Truncation of the time function to a finite number {N) of sample values 
results m leakage errors m the frequency domain Sharp transitions in the 
frequency spectrum will be smoothed out 
Discrete sequence representation in the frequency domain implies that 
the frequency domain values are the transforms of the ongmal time 
sequence treated as though it were repeated periodically with penod 
NT^(s) This effect, sometimes known as the ‘picket fence’ effect, may cause 
errors m the measurement of periodic components m the spectrum 
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5.5 ANALYTICAL DEVELOPMENT OF THE DFT 


5.5.1 Introductory Remarks and Example 

The above operations, presented graphically, can now be restated using 
anal 3 dical expressions. Consider the sampled infinite-length time function 

x,(kT,) k = -1,0, 1,... 

Its (continuous) Fourier transform X/f) is given by 

W)= f” x#T,)e-^^''^'dt 

J — 00 

and from equation (5.3), reproduced here for convenience, 
x,ikT;) = x(t) f 5it-kT,) 

— CO 

we obtain, on interchanging the order of summation and integration 


Xs(f) = I f “ x(0.5(t - /cTJ df 

— 00 — CO 


Then using the sampling property of the delta function which can be expressed 
for any function g(t) as 



- /cT^) 


dt = gikr;) 


we obtain the Fourier transform of the infinite sequence as 


W)= Z x(lcr,)exp(-j27i//cT,) (5.10) 

As was seen in Section 5.4, X^(f) is a complex continuous periodic function, 
with period 1/7^. 

For finite duration sequences, 

x(0), x(kT,), x(2/crj, . . . , x((JV - l)kT,) 

the Fourier transform Xff(f) based on N samples only, follows from the above 
as 


Xsif) = Y ^(^^s) exp(-j27t/fcrj (5.1 1) 

Jt = 0 

The discrete Fourier transform is the discrete frequency version of Xjv(/) taken 
at frequency points 


0,fJN,2fJN,... 
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Since, as we have seen, Xf.(f) is periodic, N frequency points are sufficient so 
our N-pomt DFT becomes 

Therefore, since /, = l/T,, we obtain the DFT 

^(f) = (512) 

This IS now applied, as an example, to the situation where it is desired to find 
Xs(f) and X{nfJN) for the sample waveform x(kT^ shown previously in 
Figure 5 3, values for which are tabulated for convenience as follows (also 

t; = 1) 


k 012 34567 

x(kT,) 0 -I —1 0 1 1 0 0 


We have, from equation (5 11), 

ix(k)e 

k>0 

= (-!)€■•'**•' + (- 1) e + e"^®*^ + e 

a e _ ^2*/ ^ ^ e J4«/J 

and using 

sin 0 = y (e*® — e“ **) 

we obtain 

Xsif) = -2je-^*‘''/[sin(2jr/) + sin(47r/)] 

(Note that the variable /rather than <o(= 2izf) has been used throughout this 
work for consistency even though expressions such as the above become 
slightly more cumbersome ) The original time sequence and its Fourier trans- 
form are shown m Figure 511 where 

|A"iv(/)l = 2\[sm{2nf) + sin(4n/)][ 

and 

arg[;A-,(/)] ^ j “ 6ii/ when sm(27iO + sm(4ii0 > 0 
!_ 2^ ~ fin/ otherwise 
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Figure 5.11 Example of Fourier transform of a discrete 

sequence 


Note that just as for Fourier transforms of continuous functions, since x(/c7^) is 
a real- valued function, | X f^{f ) ) has even symmetry about / = 0 and arg[Z jv(/)] 
has odd. Reference also to Figure 5.3 in Section 5.3 shows the DFT X{nfJN) of 
the same sequence as in this example for which it can be seen that 

x(^] = = nfJN 

\N ) |o otherwise 

Alternatively, equation (5.12) could have been used as will be discussed in more 
detail in a later section. 


5.5.2 DFT Frequency Resolution 

Some further comment is needed on the reasons for choosing the frequency 
spacing (fJN) in the DFT equation (5.12). The (continuous) Fourier transform 
•^jv(/) could, in principle, be evaluated in discrete form at any finite set of 
frequencies. Consider the evaluation of X^f) at say L points spaced /i,(Hz) 
apart, that is at the frequencies 


o,A,2A,...,(l- DA 

using 

^N{rfL)= Z x(kT^)exp{-j2KifLkT^) (5.13) 

k = 0 

It is now important to ask what is the minimum value of the frequency spacing 
(/l)i that is, what is the best frequency resolution obtainable? The answer is that 
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there is a likeness to the ‘Nyquist theorem’ operating m the frequency domain 
such that the values of the continuous Fourier transform Xfi{f) can be inter- 
polated exactly at all/from a knowledge of values at only the discrete frequencies 
rfi^ if the frequency spacing is small enough In particular must satisfy 

- l)T, 

This can be appreciated fay analogy with the Nyquist theorem in the time domain 
which provides a lower bound on the sampling rate (1/7^) As is well known, 
provided x(t) is band-limited with maximum frequency F(Hz) then x(f) can be 
reconstructed perfectly from sample values if the sample rate satisfies 1/7^ > 2B 
Such a finite length sequence of N samples will have total duration Tq = 
(N - I)r„ as shown m Figure 5 12, where x(r) is shown defined for a symmetrical 
time interval to parallel the symmetrical positive and negative frequency spectra 
in the time-sampling theorem The dual of this is that provided Xf/(/) is the 
transform of a finite duration function of length To, then can be re- 

constructed perfectly from sample values if the frequency samples are spaced 
such that 

1/A < 2ro/2 

or, since A « (N — 1)7^, the upper bound onA is 
^ (W - 1)T, 



\xtn\ 



\xy)\ 

Z.-scvnDles 
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It is usual in calculating DFT’s to choose a frequency spacing 


/l — 


1 

N% 


^ = 1 
N To 


(5.14) 


where Tq is the truncated sample length. This simplifies computation of the 
exponentials involved and represents an appropriate choice since in practice 
N ^ 1. Hence, the DFT denoted X(nfJN) is calculated at frequencies (nfJN) 
for n = 0, 1, . . . , that is, values at frequencies 0,fs/N, 2fJN, 


5.5.3 DFT Calculations 

Consider the computations required for the DFT of the 8 point time sequence 
used in the example given in Section 5.5.1. The DFT was shown previously in 
Figure 5.3. It is instructive to consider the DFT calculations required using 
equation (5.12) which for convenience we write as 

) = Y^(1cT,)IT"^ for « = 0, 1, . . . , - 1 (5.15) 

where the symbol W is used to replace the exponential ‘weighting functiorC. That 
is 

_ g-j2T(/w 

In iong-hand’, the computations required for the DFT example are as follows: 
the term at/ = 0 is (n = 0) 

X(0) = x(0)lF° + x(T,)W° + xi2Z)W^ + • • • + xa%)W° 
the term at/ = //8 is (n = 1) 

■X(m = x(0)W° + x(TM^ + x(2T,)W^ + . . . + x(7T,)IF’ 

the term at/ = 2//8 is (n = 2) 

^(/s/4) = XoW° + x(TJW^ + x(2TJW^ + • . - + x(7T,)W^^ 

finally, the term at / = 7//8 is (n = 7) 

^(7/s/8) = XoW° + xCTjW’ + x(2T,)W^‘^ + . • • + x(7T,)W^^ 

It is helpful to visualize the values of the exponentials (weights) plotted on a 
unit circle in the complex plane, namely 

W° = l,W^ = = e“ 

These are shown in Figure 5.13. As can be seen W° = = ■ • • and 

, and in general 


W‘ = IF''+‘ 


(5.16) 
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Figure 5 13 Exponential weights 
in the complex plane 


This fact can be used m programming a computer to carry out the above 
calculations The FFT program algorithms first published by Cooley and Tukey 
m 1965 gave tremendous impetus to the DFT processing techniques because 
of the drastic reduction brought about in computer processing time required 
As can be seen from the above, in the evaluation of each term for n = 0, 1, , 

(N — 1) certain exponential functions occur repeatedly For example, m the 
above 

W* = e occurs when n = 1, A = 4 
n = 2. fc = 2 
n = 4, fc * 4 

By calculating these repeated functions only once and working on all N 
summations at the same time, the FFT algorithms can give a time reduction 
factor of about (logj N)/N over an unsophisticated DFT calculation As an 
example, for N = = 16,384 points, a time reduction factor of 14/16384 

results, (say, a reduction from an hour to a few seconds) 

The FFT algorithms will be discussed further m a later section Suffice to say 
at this point that the FFT makes it feasible to take many samples of tune 
functions and to compute their Fourier transform almost as they occur The 
advantage of this in applications such as real-time spectrum analysers will be 
examined in the next section 

Before doing so, it is important to clarify the frequency resolution question 
posed earlier as to the minimum frequency spacing between the frequency 
domain values We have seen that the N-point DFT (or its FFT counterpart) 
yields an iV-point frequency sequence with frequency spacing (/;,) equal to the 
inverse of the sample length That is 
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Thefrequency resolution can therefore be improved by the following techniques : 

(a) For a given sampling rate, increase the number of sample points (iV) 
used— at the expense, however, of computing time. 

(b) For a finite length sequence, by artificially adding zeros to the end of the 
data string. For example, the above 8-point DFT considered previously 
could be written as a 16-point data set x^lkT^) as follows 



c 

0 1 

2 3 

4 

5 6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

cTJ 

0 ~1 

-1 0 

1 

1 0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


It is easy to see that both sequences xiUT^) and the augmented set x^kX) have 
the same Fourier transform ‘shape’ since equation (5.11) becomes 

^N{f)= S^(^^s)exp(-j2n//crs) 

k = 0 

= ^.(/)= Zx,(/cT,)exp(-j2;i//cT,) 

k = 0 

with 

W) = n = 0,1,2,..., 15 

This result is illustrated in Figure 5.14 



Figure 5.14 The effect of zero adding on frequency resolution 
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5 6 SPECTRAL ANALYSIS FROM SAMPLES OF SIGNALS 


5 61 Introductorj Remarks 

The de\eIopment of DFT techniques and particularly the application of the 
FFT algorithm in microprocessor-based hardware has led to a new generation 
of test equipment for measurement and displaying of frequency spectra of 
signals Samples of an analog signal can be taken and the DFT calculations 
earned out so rapidly that snapshot displays of the magnitude of the frequency 
spectral distribution can be presented with no noticeable delay— hence the 
term rLal-time spectrum analysers As further time sequence data are taken in 
with \arying spectra (such as speech sounds), so the analyser frequency display 
can be sequentially updated Such techniques find apphcations m most areas 
where signals are being processed such as in \ibration analysis, pattern re- 
cognition, image processing, speech synthesis, medical electronics (analysis of 
EKG or EEG signals) seismic and other graphical measurements, radio astron- 
omy, and in communication systems This section discusses the pnnciples of 
spectral analysis which should be seen as encompassing a variety of different 
measurements each centred, however, around the DFT 
Many signals of interest in real life are random For deterministic signals, the 
frequency spectra can be determined by use of the Fourier series (for penodic 
signals) or the continuous Fourier transform (for analog signals which tend to 
zero for large time) The DFT provides a means for generating the voltage 
spectrum Xif) of such signals For random signals the power spectral density 
Sgif) is used and is approximated by appropriately averaging successive 
calculations of \X{f)\^ Firstly, however definitions are needed in order to 
proceed 

A discrete time random (or stochastic) signal jc(n'^) or can be described by 
a collection of samples or snapshots of the signal as shown m Figure 5 15 
At a particular time, say T, = 3, a random lariable 2 ( 3 , say, is defined as the 
collection of sample value realizations A sequence of random variables 

Xo,X„ ,X,. 

can be visualized, each being defined m terms of the values at times 0 , 1 , , 

The autocorrelation function R^k) can be defined 

= ElX,X„^f] (5 17) 

namely the expectation of the product of two random vanables (R Vs) separated 
in time by A. samples Real times are expressed in terms of n and the number of 
lags m terms of k rather than (nT^) and (ki;) to reduce the number of symbols 
required The scale factor T, can be re-introduced whenever required without 
any complication 
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Figure 5.15 An ensemble for a discrete- 
time random process 


The expectation of the product X„X„-k is the average over the ensemble of 
all possible realizations of X„X„-i: at a particular epoch n, that is, the average 
for all possible time sequences which the system may be thought to be capable 
of producing. In practice, we only ever have available to us one of those realiza- 
tions. Segments taken from further on in the signal sequence x(/c7^) are, of 
course, parts of the same realization. 

We, therefore, limit our attention to those signal sequences which are 
stationary, that is, those for which EIX„X„^;J does not depend on the epoch n, 
but only on the lag k. (This has been implied above by writing Rx(k) as a function 
of k alone.) Then we regard the expectation as a time average, that is, an average 
of the product X„X„-k over all possible epochs n. It is not certain in general, 
even for stationary sequences, that this procedure is legitimate. We call those 
signals, for which the ensemble average is given by the time average, ergodic. 

Provided the random signal is stationary, the power spectral density S^(f) 
can be defined as the Fourier transform of R^ik), which may be written 

5,(/)- f i?,(/c)exp(-j27t/kr,) (5.18) 

—<30 

Since R^ik) has the dimensions of power, it can be seen, by analogy with the 
DFT of x(/c 7^) mentioned earlier, that values of Sx(f) measure the contributions 
to signal power at each frequency/ 

The time average for an infinite sequence is estimated from a finite segment of 
the sequence by 

I N-lk) 

11=1 
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Figure 5 16 Averaging filter with discrete random input 


and this is also the estimator for Rj,{kX the ensemble average, provided the signal 
sequence is ergodic The divisor is N rather than N — \k\ for reasons to be 
discussed later (Note that ^,(A:)js symmetric in it if the sequence is stationary) 
This IS now applied to an example Consider a zero mean random signal 
X(«7i) or X„ being the input to a network which provides an output Y„ such 
that the RV at time (nT,) is given by 

= + (519) 

(This is a simple averaging digital filter and is illustrated m Figure S 16 where 
sample sequences are intended to show that the output can be expected to 
fluctuate less rapidly with time) We wish to describe the autocorrelation 
functions and power spectral densities of the input and output signals 
Since the mput signal X„ is purely random with zero mean, its autocorrelation 
function is zero for all lags except fc = 0 since 

R^ik) = EIX„X„^^ = J = 0 k = ± 1, ±2, 

For the output 

W = 

For lag k = 0 

£,(0) = Eaxi^ + £[U^,] = lElXn 

= cj = al/2 

For lag k = 1 

R^l) = £Llxa = «J/4 

The same result applies for k = — i (as expected since autocorrelation functions 
are always even functions) For other values of lag k, the Ry(k) is zero since the 
Y„ product has no common sample RV’s Figure 5 17a illustrates these 
results 
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Figure 5.17 Filter input and output; (a) autocorrelation functions; 
(b) spectral densities 


To compute the spectral densities 5*(/) and S//) we use equation (5.18) 
Si/)= E R,{k)txp{- 32 nfkT,) = RM = <Tl 

k= — 00 

since only the fc = 0 term is non-zero. SJ^f) is, therefore, a constant (white 
noise). Similarly 

E(/) = E Ry{k)exp{-j2nfkT,) 

k=-l 

= exp(j27r/rj + exp(-j27t/TJ 

= icr^Cl -h cos{coTJ] 

These results are plotted in Figure 5.17b. Clearly the averager has performed a 
filtering function (low-pass) on the incoming signal. 

Spectral measurements of random signals based on finite length sample data 
sequences are complicated by questions such as: 

(a) Is the signal stationary or can any non-stationary trends be removed? 
Otherwise S^if) cannot usually be defined. Signals may have slowly 
varying mean values, for example, which must first be removed. 

(b) Can we estimate S^if) from the one realization available? We shall use 
statistical tests on our estimation procedures to see how well they perform. 

We will examine the second problem first, assuming the signal is stationary with 
with zero mean. Two computational alternatives for estimating the spectral 
density S,(/) are to estimate RJJc) from a finite segment of one realization, and 
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then take the Founer transform of the estimate, or to estimate S,(/) from the 
frequency domain sequence directly 

■fhe second method has a decided advantage over the first where the FFT 
algorithm (see Section 5 7) is available in the digital processor (If necessary 
Rjk) can be estimated by use of the inverse Founer transform ) 


5 6 2 The Penodogram 

However obtained, the estimate of the spectral density from a finite segment of 
a single realization is called the periodogram, denoted Ssif) to indicate that it is 
based on a sample x„ of length N It is defined by 

Ss(f)^ t R^k)Gxp(~j2nnm (5 20) 

It can be shown (see Schwarz and Shaw 1975, p 160) that this is equivalent to 

S^(/) = -^1 (521) 

Sf,(f) may be plotted as a continuous function or for only discrete values of / 
Note that we originally had (equation (59)) 

A'j¥(/)= Z 


so 

s,(/) = ^|X.(/)|' (522) 

The periodogram estimate is found from the N point DFT which is used 
for deterministic or random signals 

Practical calculations based on equation (5 21) unexpectedly give very 
irregular results— the irregularities not diminishing as the length N of the sample 
is increased It is important to understand and be able to overcome this problem 
We have already seen how the sampling and truncation procedures give rise to 
modifications to the original true X(f) function The difficulty with the spectral 
estimation procedure springs from the random properties of the signals 
We begin by asking how accurately equation (5 21) estimates the true spectral 
density for random signals One way of answering this question is by examining 
the bias and the variance of the estimate SfXf) to determine whether they 
approach zero as the number of samples N is increased An estimator is said to 
be unbiased (good on the average) if the ensemble average of many N-pomt 
estimates approaches the true value of the parameter (random variable) being 
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Figure 5.18 Frequency spectrum bias 

estimated. The bias at a particular frequency /', in the estimation of the 

true spectrum is therefore given as 

Bsif) = S,(/') - ElS,(f')-} (5.23) 

This is illustrated in Figure 5.18. 

Now the estimator R^ik) mentioned earlier is a biased estimator of Rxik), 
since 



(and not R^ik) exactly). Since our estimate of S^if) is obtained from this 
estimate, is also a biased estimator. Specifically, 

ELS, if)-] = ^ i Rx(k)^l - ^ j exp( -}2KfkT,) (5.24) 

An estimator is said to be consistent if its variance approaches zero as the 
number of samples N increases. In the case of the periodogram, we require 

lim EKS^if) - £[5;,(/)])^] 

N->oo 

to vanish. A consistent estimator is one which approaches the true value 
smoothly as N increases, that is, as new information about the sequence is 
received the estimate steadily improves. 

Returning briefly to the autocorrelation estimator Rsik), it is true that its 
variance vanishes for large N, that is 
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Thus It IS meaningful to say 

J N-|*« 

lim — 2 x.x.*|,| = R,(l) 

N-CO “ it=l 

for any realization ati.Xj.Xj, of the signal sequence It is for this reason that 
the biased estimator was chosen, (If the divisor is N — |A.|, the variance docs 
not dimmish (see Jenkins and Watts, 1968, p 179) ) 

But the same does not hold for the estimator 5^(/) The variance of S^(f) 
does not generally dimmish even for large N This has very serious implications 
for interpreting spectral diagrams produced by the DFT since successive 
calculations of S>,(f) using different sample values from the same source can 
give quite different S,(/) plots What is worse, for some random sources, the 
variance or uncertainty can be as large as the true value For example, it can be 
shown (Jenkins and Watts, 1968, p 233) that for white Gaussian noise the 
variance is equal to the expected value Thus we have 

s«(/) = I R,(k)ixp(-,2nfkT,) 
and 

W) = ^I «.(*:)exp(-j2ii/fcT.) 
but while It IS true that 

hm R„(k) « R^ik) 

It IS not true that 

hm S^if) = S.(/) 

Such a result should come as no surpnse when we recall that the and 
S,(/) are each ensemble averages, taken over all the realizations that the system 
might possibly generate, and that we are attempting to estimate each by time 
averages from a single realization of^mre length 
It is for this reason that DFT spectral diagrams may appear quite ‘ragged’ or 
irregular even though, say, the signal being transformed may have a smooth 
spectrum For example, low-pass white Gaussian noise may appear to have a 
spectrum such as shown m Figure 5 19 (Users of conventional analog tuned 
radio frequency (TRF) types of spectrum analysers will be well familiar with the 
care needed for appropnate choice of analyser bandwidth and sweep speed to 
avoid similar vanations when trying to measure the spectral properties of 
random signals) With the DFT estimation of spectra there are two ways of 
reducing the errors (variance) The first, more traditional approach, is by the 
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Figure 5.19 Estimated power spectrum for 
low-pass noise 


use of smoothing windows and the second more recently used method is by 
averaging over several periodograms. 


Smoothing windows 
Rewriting equation (5.24) as 

£[Sw(/)]= E Rx(l<)Vk s^p{~i2nfkT,) (5.25) 

— 00 

where 


1-^ \k\^N 
0 1*1 > N 


(5.26) 


The triangular function plotted in Figure 5.20 is known as a lag window, and 
comes about because only a finite number (iV) of samples is used. (Recall also 
frequency domain aliasing distortion due to finite sample length in calculating 
for deterministic signals.) 

The lag window factor causes Sfi(f) to be a biased estimator of 
Furthermore, equation (5.25) shows that the mean of the periodogram £[S„(/)] 
is the Fourier transform of two time functions multiplied together, namely 
Rx{k) and which we can write in short-hand form 


£[S„(/)] = FlRMvn 

where F[ ] stands for the Fourier transform operation. Using the fact that 
multiplication in the time domain corresponds to convolution in the frequency 
domain we can write 


£[S„(/)]=F[R,(*)]*F[t;n 
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Figure 520 EfTect of lag window (a) (nangular tag 
window function (b) transform of lag window (c) true 
and estimated autocorrelation functions of lag window 


But FlRJ^ky] = S,(/) and letting = F[ot], we can wnte 

Eis,(fy] = sj^n*Vn{D (5 27 ) 

The Founer transform y„{f) of the tnangular lag window is shown m Figure 
520b Figure 5 20c illustrates true and estimated autocorrelation functions, the 
difference between the two waveforms being the bias in the estimate of Rt(F) 
However from equation (5 27) we conclude that the mean of the spectral estunate 
IS the conrolution of the true discrete spectrum and a frequency domain wmdow 
function shown m Figure 520b For large N, this wmdow will have very narrow 
sidelobes and a very sharp peak (tending towards a delta function) so that for 
large N 

and the estimator can be said to be asymptotically unbiased 
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An obvious way to reduce the variance of Sf,(f) is to divide the signal sequence 
of length N into a number (say s) of equal subsequences of length M = N/s. 
Then the variance of the new estimator, the average of the estimators for each 
subsequence, will be less. In the extreme case of a white noise sequence, the 
variance is reduced by a factor s. 

It can be shown (Jenkins and Watts, 1968, p. 241) that this procedure is 
equivalent to multiplying the autocorrelation function by the function 


w(k) = 


1_!^ IkKM 
M 


.0 


|/c| > M 


This is an example of a special lag window called a smoothing window, and is 
identical in form to the lag window mentioned above. (This example is known as 
Bartlett’s window.) As noted above, it is also equivalent to taking the convolution 
of the true spectral density function S^if) with the spectral window 


W{f) = M 


'sin(M7t/)\ 

. Mnf ) 


2 


As M becomes smaller, the variance of the estimator decreases. However, there 
is a corresponding increase in the width of the pass band in the spectral window, 
and the bias of the estimator may be large. Thus we have to sacrifice bias for 
the sake of reducing the variance of the estimate. 

Another approach to reduce the variance of Sjv(/) might be as follows. The 
variability may be due to the variability in R^ik) for |k| close to N, since these 
are obtained by averaging over only N — \k\ points. So to consider only those 
values of J?^(k) for small k (say |k| < M) must surely reduce the variance of 
S;v(fe). 

Evidently this truncation procedure is equivalent to multiplying Rsik) by a 
rectangular window 


w(/c) = 


1 IkKM 
0 ilcl>M 


The effect of this truncation is basically determined by the sine function shape 
of the Fourier transform of this rectangular time window. The sharp discon- 
tinuities in the time rectangle give rise to undesirably high levels of sidelobes in 
its transform (the sine function). Better spectrum analyser performance may 
result if a continuous and smooth time window were used to weight sample time 
functions (or autocorrelation functions). Many window functions have been 
suggested in the literature. The problem is essentially one of selecting a time- 
limited function with minimum energy outside some selected interval. Some 
well known window functions are listed in Table 5.1. 
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Table 5 1 Some well known window functions 


Window function 


Time domain weighting function K(fc) 

reclangulai 


{o 

otherwise 

Hamming 

w(k) = 

Jo 54 + 0 46 cos{2nk/N) 

to 

otherwise 

Hanning 


J05 + 05cos(2jtfc/Ar) 

to 

i(JV - 1) < fc ^ i{N - 1) 

otherwise 


It has been noted that in respect of random signals, window function smooth- 
ing (multiplication m the time lag domain giving the effect of convolution or 
smoothing in the frequency domain) may in some cases reduce the variance at 
the expense of greater bias In general, it is true to say that there is no single 
window function appropriate for all purposes Thrane (1979), for example, says 
that only two are important m spectrum analyser applications, namely the 
rectangular (or fiat) window and the Hanning window The flat window is best 
for transient signals that can be completely contained wjibm the truncation time 
window (DFT analysers are very suitable for the analysis of transients since 
they are designed to work with ffoiie block samples of time data ) On the other 
hand, Hanning weighting is recommended for continuous signals A procedure 
for window smoothing involving three FFT operations is as follows 

(a) calculate Ssif) from the data by FFT, 

(b) find Rsik) by inverse transform using the FFT, 

(c) multiply J?jv(k) values by an appropriate weighting window, and 

(d) use FFT to compute final estimate of S„(f) 

More details are available in many of the references cited at the end of this 
chapter 

Averaging periodograms 

An alternative approach to reducing the variance of S„(/) is to calculate several 
periodograms and to average them together As we have seen, for random 
signals, successive spectra show significant fluctuations or differences due to the 
variance of the estimator Averaging can improve the estimate and also enhance 
the detection of deterministic or penodic signals buried in noise In the latter 
case one might expect the signal-to noise ratio to be enhanced by the square 
root of the number of additions of complete spectra Most DFT-type spectrum 
analysers have provision for selection of a specific number of spectra to be 
averaged or a running average with continuous updating of the spectral display 
This assists m the detection of signals buned m noise or the separation of 


DISCRETE SIGNALS AND FREQUENCY SPECTRA 


233 


periodic and random components. An alternative to ensemble averaging of 
complete spectra would be to average partitions of a spectrum. As an illustration, 
N = 16,384 signal samples, say, could be sectioned into 16 segments. Then a 
1024-point FFT could be performed on each, the results being averaged to 
obtain the final result. Also, overlapping the sequences (e.g. average 32 over- 
lapping 1024-point transforms) can be used to further reduce the variance (see 
Rabiner and Gold, 1975 for details). 


Other considerations 

It was stated above that the use of spectral analysis instrumentation usually 
requires an intelligent operator approach to ensure that meaningful results are 
obtained. In this respect, the newer DFT analysers have much in common with 
their predecessors— the analog swept-tuned analysers for which an appropriate 
choice of such parameters as sweep speed, maximum dispersion and analyzer 
bandwidth are important in order to avoid gross errors. With digital processing 
analysers, it is likewise highly desirable to have some prior knowledge of the 
spectrum under test or at least to carry out several tests on typical records to 
avoid errors due to such factors as aliasing errors (sampling rate), poor resolu- 
tion (sample window width), bias, and variance (smoothing windows or 
averaging). In addition, it was mentioned early in this section that the Fourier 
transform process assumed the signal under test was statistically stationary. For 
example, the signal sample shown in Figure 5.21 illustrates a systematic trend 
(time varying average) which should first be removed. 

Such trends should be removed by least-squares fitting procedures to remove 
straight-line, parabolic or higher-order trends (see Schwartz and Shaw, 1975 
for details). 

In addition, if the signal contains discrete spectral components (sinusoidal 
or other periodic components), these would be expected to result in delta 
function responses in the frequency spectrum. This can be thought of in terms 
of the weighted sums in equation (5.1) adding up to excessively high values for 
some frequency terms. Preprocessing, for example by filtering, may be necessary 
to reduce bias errors once the peaks have become evident from preliminary 



Figure 5.2 1 A non-stationary signal sample 
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Spectral computations More advanced texts, such as Jenkms and Watts (1968) 
provide details The design of digital filters that could be employed are discussed 
m Chapter 10 


5.7 THE FAST FOURIER TRANSFORM ALGORITHM 

Although the principles of discrete Founer transforms have been known and 
used for many jears, the range of applications has been restricted by the rela- 
tively large number of trigonometric calculations The fast Fourier transform 
(FFT) algorithm developed by Cooley and Tukey m the 1960’s for more 
efficiently carrying out the DFT calculation (equation (5 1)) completely altered 
the situation Many manufacturers now produce digital processing spectrum 
analysers capable of measuring transients or of handling frequency spectra up 
to frequencies of the order of 100 kHz and to produce real time updated displays 
of signal spectra as they vary with time Figure 5 2 illustrates a typical analyser 
commonly used in the anaijsis of signals Other applications in the medical 
field for analysis of electrocardiogram waveforms are becoming widespread 
The FFT algorithm is simply an efficient means of calculation of the discrete 
Fourier transform calculation 

X(n/N) = ''j^x(k)\V'^ (528) 

k 0 

where IF = and T, = I for simplicity We have seen that each of the N 

frequency terms X{n) requires the sum of N terms x{k) each multiplied by the 
complex exponential weighting term IF"* To take a trivial case, a 16-point 
transform (N = 16) requires 16 complex multiplications for each frequency 
term and N complex additions, the former being more time-consummg to 
perform The calculation of all 16 frequency terms, therefore, requires this 
process to be repeated N = 16 tunes In total, = 256 multiplications are 
requued for the whole spectral calculation— actually only {N — 1)* of these are 
complex multiplications because for n = 0 or fc = 0 the 1F° term is unity 
Cooley and Tukey considered the decomposition of the transform into 
smaller groups Consider initially the N-pomt x(k) sequence broken up into 
two N/2 sequences A DFT of each of these sequences would require (N/2)^ 
multiplications Since there are two DFTs to be performed there exists a total 
of2(N/2)^ = 128 for iV equal to 16(compare with = 256 previously) Now, 
if it can be shown that these two semi-spectra can be appropriately combined 
by a simple computation procedure, then a saving in the number of multiplica- 
tions has been achieved The concept can be carried further If N is a power of 2 
then what happens if the x(k) sequence is broken up into four equal groups'^ 
Each subtransform then would require (N/4)* calculations, a total of 
4(N/4)* = 64 multiplications for the whole spectrum This idea of subdividing 
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Figure 5.22 Stages involved in a 16-point DFT 


can be continued down to sets of transforms of pairs of input data. The total 
number of times the N points can be split into halves is log 2 N. Figure 5.22 
illustrates the operations involved for N = 16. 

This appears excellent in principle but the key question to ask is how can 
individual DFT’s of separate sets of input data be combined to obtain the correct 
frequency transform values. After all, it must be recalled that each output 
frequency value is a weighted sum of all input values. For example the value 
X(n = 1) is given by 

N-l 

X(1/JV)= 

*=o 

and contains contributions from all N of the x{k) terms. Consider the sequence 
x(k) broken up into halves, one half containing even numbered samples and the 
other odd. The DFT then consists of two parts as follows 

N-l 

X(n)= J^xCkW"*^ 

k = 0 

N-2 N-l 

= X x(/c)lF"'‘ + ^ x(/c)lF"'= 

k = 0,2,... fc=1.3,.,. 

(fceven) (kodd) 

N/2-1 NI2-1 

= X x(2k)W^"’‘ X ^(2^ + 

A. = 0 1 = 0 

allt alU 

N/2-1 N/2-1 

= ^ xi2k)W^’''‘ -1- IF" ^ x(2k + 

k=0 k=0 

' — ' ' , ' 

(iV/2)-point DFT of (W/2)-point DFT of odd 

even x(k) points x(k) points 
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Therefore, we have the combining rule for finding X(n) m terms of two (N/2)- 
point transforms ATifn) and X 2 (n), say, as follows 

^(fi) = X^(n) + WX2{n) n = 0, 1, ,lN - I (5 29) 

Note that since A’lfn) and A'jfn) have only N/2 values each, equation (5 29) can 
only provide Xfn) values for half the N-range, that is for values 0 < n ^ 2 ^ - 1 
For higher n-\alues (JN ^ n < W — 1), we find the combining rule is 

XOO = Xi(« - iN) + ir-YjCn - JW) n = iN, jN+1, ,N- 1 (530) 

This follows from the periodic nature of the DFT.an N-point DFT is repetitive 
after N points and so also an (N/2)-point DFT is repetitive after N/2 points, 1 e 

Xi(n + ^N) = A',(n) « = 0,1,1, AN -I 
Xiin) = Xiin-^N) n = JN. JN + 1, ,N-l 
and likewise for A' 2 (n) 

Rewriting equation (5 29), replacing w by n + yields 

Xin + IN) = A'i(« + iN) + + ^N) n = 0, 1. , JN = 1 

and since 




we have the form 

^{,1 + ^N) = Xi(n) ~ W'Xjin) 11 = 0, 1. , JN - 1 

or, by change of variables (that is, make the range of n run from N/2 to N ~ 1), 
we obtain equation (5 30) 

Summarizing the above derivations 

(a) the x(k) sequence values are grouped into odd and even subsequences, 

(b) these two subsequences are then transformed using (N/2)-point DFTs to 
give A',(n) and X 2 {n), 

(c) the combining rules (equations (5 29) and (5 30)) are used to give Xin) 
For example, for N = 8 

^(0) = Xi(0) + iV^X^iO) and ^(4) = ATifO) - W°X 2 i 0 ) 

Xa) = X,il) + W^XtH) and X(5) = X.fl) - W^X 2 ii) 
and so forth 

This IS illustrated in Figure 523 where step (c) can be seen to contain only 
three different complex multiplications involving W^, W^, and 
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Figure 5.23 Final stage of an S-point FFT 
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It is possible to improve the efficiency of step (b) by finding Xi(n) and X 2 {n) 
each in terms of two half sequences. Note that the FFT process is best under- 
stood by logically working backwards from the final result X(n) to the data. 
Let Xi(n) be combined out of two (JV/4)-point sequences Xii(n) and Xi 2 (fi), 
say, using 

Y f 3 = piiW + n = 1 

_ ijV) - WXi 2 (n -iN) n = iN,iN + l,...,iN-l 

and similar for X 2 (n). 

This process is continued (back towards the data) until we are left with an 
array of 2-point transforms (of the data itself) to evaluate. For our 8-point 
transform example the complete FFT is summarized in Figure 5.24. 


X(0) 

X(4} 

XU) 

XU) 

Yd) 

Y(51 

Y(3) 

X{7) 


2 point fronsforms 2+2—4 point combiners 4+4—8 point combiners 



Figure 5.24 8-point FFT 
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This introduction to the principles of the FFT is concluded by noting the 
following points 

(a) For N a power of 2, there will be logi N stages of the FFT (number of 
subdivisions) In each stage there will be, at most, N different complex 
multiplications Hence the FFT uses approximately N logj N complex 
multiplications compared with for an unsophisticated DFT calculation 
(for example, a 100 times reduction for N = 1024) 

(b) In order to obtain the output sequence X(n) in natural order, the input 
sequence has to be shuffled It can be shown that if the order indexes (fe) 
of each of the sample values is expressed in binary form and then the 
binary number is reversed, the result gives the new sequence order required 
For example in the 8-point DFT above, x(l) is in position 4 since 1 m 
binary is 001, and, therefore, 100 in reverse order 

(c) A number of flow graph representations have been used to illustrate the 
above steps in computing the FFT (see eg Rabiner and Gold, 1975 for 
details) 

(d) Computer programs for efficient calculation of the FFT are now commonly 
available as software packages in many computer installations They 
include the necessary data sequencing and efficient methods of computing 
the exponential (IF"‘) factors 

Many references, such as Brigham (1974), Rabiner and Gold (1975), or Bogner 
and Constantinides (1975), provide further details on such matters as the use of 
flow graphs for representing FFTs and variations on the simple FFT scheme 
described above Zoom FFT, in which a specific part of a given spectrum can 
be given increased resolution, is covered in Thrane (1980) 
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Chapter 



D. HOFMANN 


Measurement Errors, Probability and 
Information Theory 


Editorial introduction 

The perfect measurement system does not exist nor does the perfect measurement circum- 
stance, Furthermore the parameter being measured is, in the philosophical limit, an 
inexact entity. Thus all measurement situations are subject to disturbance from many 
error sources. When finally deciding just what the measured data represent it is absolutely 
essential to consider the various sources of error and how they can validly be combined into 
a simple statement. 

Good measurement arises as much from the study of errors of measurement as it does 
from the choice of principle. All measurement situations should be considered as grossly 
in error until proved sufficiently accurate by theoretical and/or practical verification that 
they suit the need. Too often data are accepted without serious question that they may not 
be an adequate mapping for the physical variable into a representational equivalent. This 
chapter addresses the question of error estimation. It shows the sophistication that is 
available to apply when the demand of certainty of knowing how accurate a measurement 
really is, must be high. 


6.1 INTRODUCTION: CLASSIFICATION OF MEASUREMENT 

ERRORS 

6.1.1 General Remarks 

Increasing automation in research, development, production, and consumption 
constantly creates new tasks for application of measurement engineering 
fundamentals. The number of sensors and measurement devices applied to a 
given task is constantly increasing. Their accuracy, application range, dynamics, 
reliability, and production life constantly need improving. The wide application 
of microcomputers and microelectronics has resulted in new possibilities for 
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measurement engineering The effective application of high performance data- 
processing modules with microprocessors requires 

(a) mathematical formulation of measurement processes, 

(b) the generation of algorithms of the measurement strategy, 

(c) impro\ ement of the precision of measurement-error analysts 

Models are preferred (Hofmann, 1979, Doebelin, 1975), for analyzing and 
synthesizing measurement processes 

Models especially mathematical models, are simpler, cheaper, and more 
easily described and \aned, compared to their originals (measurement signals, 
measurement systems, measurement processes) Moreover they are space or 
time transformable, can be optimized within certain limits and they can easily 
be taught and learned 

The following considerations on measurement error, probability, and 
information theory deal with fixed (determined) and variable (stochastic) 
behaviour models of measurement objects (measurement signals, measurement 
systems, measurement processes) They are represented by mathematical 
algorithms The objects and their models consist of elements and relations 
between them Elements are functional groups or numbers The organizational 
form of these elements of an object (model) is its structure The element’s 
behaviour is determined by the states of the elements of an object (model) being 
m interaction with other things and dependent on time 
In any case « is necessary to consider 

(a) that different points of view might supply different models of one and the 
same object, 

(b) that different aims might result m an emphasis on or suppression of dif- 
ferent structures and of the behaviour of one and the same object, 

(c) that several phenomena of the objects might be described more simply or, 
vice versa, m a more complicated manner, always being independent on 
their importance for the measurement-task solution, 

(d) that the model quality should always to be tested practically, with the help 
of the real object, not by an idealized approximation 

Every process type has got its own mathematical model theory (Peschel, 1978) 
This IS caused by the nature of processes Furthermore, model selection is 
influenced by knowledge attained, by certain opinions occurring m con- 
sidering the model, and by calculational and experimental advantages 
The processes and models can be classified according to the following 
properties (Lange, 1978) 

(a) analog and discrete (amplitude-quantized) processes, 

(b) continuous and discontinuous (time quantized) processes, 

(c) determined and undetermined (stochastic) processes, 

(d) decimal and binary (dual, BCD) processes. 
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(e) periodic and non-periodic (aperiodic) processes; 

(f) linear and non-linear processes. 

In the following, measurement error analysis is to be dealt with in detail, 
considering, in particular, those linear and quasilinear measurement systems 
having determined input quantities as well as undetermined disturbing signals 
that modify output quantities. 

Published works on treatment of errors have tended to have been prepared 
to suit an area of application rather than be kept general. Texts, such as Barry 
(1964), Bendat and Piersol (1971), Halstead (1960), Helstrom (1968) and 
Topping (1972), may provide the specific treatment required. 

6.1.2 Classification of Measurement Errors 

Systematic measurement errors (bias) can be caused by use of imperfect measure- 
ment devices, measurement procedures, and standards, by environmental 
influences (influence quantities) existing during measurement, as well as by the 
influence of the measuring person. Systematic errors can be determined and 
eliminated according to a principle and are, in theory, predictable on a single 
measurement basis. 

Random measurement errors (uncertainty) can occur in sensitive measure- 
ments conducted under repetitional conditions. Repetitional conditions exist 
when the measuring person is repeatedly measuring the same measuring 
quantity, using the same measuring device and the same measurement proce- 
dure. The scatter of the measured value is caused by temporally and locally 
non-constant error sources. Random measurement errors cannot be individually 
eliminated because it is not possible to predict the error magnitude at a given 
time. 

Gross measurement errors are caused by mistakes, wrong or careless readings, 
a temporarily defective measuring device or strong disturbing influences from 
outside. The measuring person can avoid gross measurement errors during 
measuring and, therefore, these errors can be considered, and neglected, as 
degenerated ones. 

Additive measurement errors are characterized by their property of being 
additively superimposed upon the measured value. They do not depend on the 
numerical value of the measured quantity. Additive measurement errors, 
therefore, occur as an undesirable zero-point displacement. 

Multiplicative measurement errors are characterized by their property of 
being multiplicatively superimposed upon the measured value. These depend 
on the numerical value of the measured quantity. Multiplicative measurement 
errors, therefore, are based on sensitivity deviations of the measurement system 
from its desired value. 

Absolute measurement errors are defined as the difference between the actual 
measured value and the error-free measured quantity sought. 
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Relatiie measurement errors are defined as absolute measurement errors 
divided by a reference quantity Percent measurement errors are relative 
measurement errors multiplied by 100 

Reduced measurement errors, or error classes, are absolute measurement 
errors related to a measurement range multiplied by 100 
Static measurement errors are characterized as a time function with a constant 
value 

Dynamic measurement errors are characterized as a time function with 
changing values 

In measurement analysis two methodical error sources should be considered 
Frequently the measurement position in the surroundings of the desired 
quantity a (refer to Figure 6 1) is not accessible (for example, the inner tempera- 
ture of a hot workpiece) The accessible quantity b (surface of a hot workpiece) 
IS changed m its state by the measurement process (mounting of the surface 
thermometer) The acquired quantity c is equated to the desired measured 
quantity x. Between a and c, there exists a difference that can often not be 
ascertained 

Both systematic and random measurement errors can occur together, both 
ansmg from the same piece of physical apparatus The physical-technical 
and mathematical investigations earned out separately on them are based on 
technical and arithmetic advantages Random measurement errors are techni- 
cally more easily comprehended Moreover, there exist useful mathematical 



1— ^ ^ t 



•^1 
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Figure 6 1 The influence of the measurement system on measurement 
mformation acquisition and measurement error analysis a, desired 
quantity, b, accessible quantity, c, attained quantity, d, picked-up 
value.e, comprehended value,/,measured value l,sensor,2, processor, 
3, display 
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algorithms for their arithmetic treatment. A practical possibility is that syste- 
matic measurement errors might be neglected without having been considered 
properly. 


6.2 DETERMINISTIC ERROR MODELS 

Deterministic error models are presumed to hold when there is a deterministic 
reason producing a deterministic result. Repeated measurements always result 
in the same measurement values. In the measurement process of this kind only 
systematic measurement errors can exist. 

The model is considered to be justified under the condition that random 
measurement errors are negligible compared with systematic measurement 
errors. The practical, recommended value is represented by the ratio q ^ ro, 
that is, when random errors are one tenth or less of the error budget. 

Random measurement errors are not observed if measurements are ac- 
complished with insensitive measurement devices. Adequate system discrimina- 
tion is essential. 

The absolute measurement error Ax is the difference between the incorrect 
actual value x, and the correct nominal value x^: 

Ax = X, — Xs (6.1) 

If the actual and the nominal values are time dependent, the systematic measure- 
ment error is 


Ax(t) = x,(f) - xs(t) (6.2) 

The incorrect measurement value appears at the measurement-system output. 
This is designated by the index a: 

Axa(0 = Xa,(r) - X3s(0 (6.3) 

In system theory the transfer property of a system is usually designated by its 
transfer function S(p) (often also written Gip)). It is defined as the quotient of the 
Laplace transform of the output quantity to the Laplace transform of the input 
quantity. The error-free transfer function of the measurement system is 

Ss(p) = x^s(p)/x^(p) (6-4) 

It is defined by the measurement task. The actual transfer function of the 
measurement system is 

Slip) = jCai(p)A(p) (6.5) 

The measurement-error transfer function 


Spip) — 


Slip) 

Ssip) 


x^sip) 


( 6 . 6 ) 
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IS typical for systematic errors in measurement processes In the linear working 
range of the measurement system it does not depend on the input quantity 
magnitude 

In the complex frequency range (p-range) the systematic measurement error 
satisfies the equation 


Ax.(p) = 



( 67 ) 


Table 6 1 Laplace transforms of common transfer functions 


No 


m 

1 

1 

P 

1(0 

2 

1 

p~ a 

e" 

3 

1 

pip -a) 


4 

1 

pip + a) 

Nl-e ") 

5 

TTTp 

T 

6 

1 

P(I + Tp) 

1 - 

7 

1 

(1 4 - r,p)(i + T^p) 

h ~ h 


1 



p(l + T,p){l + T^p) 



1 + T^p 



1 + T,p 

Ti 

10 

i + r,p 


Ki + r,p) 

Ti 

11 

1 + T3P 

1 In-T, 

(1 + T-ipXl + T^p) 

T,-T,\ T, T, 1 

12 

l + TiP 


p{i + rip)(i + r^p) 

•t — *2 
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Transfer of the systematic measurement error to the time domain (t-domain) is 
accomplished by the Laplace transform. The transformation rules are 


1 ^(T + j0> 

= F(p)^’’'dp = L-^{F(p)} 


/•CO 


F(p)= mt-p'dt = L{f(_t)} 
Jo 


( 6 . 8 ) 

(6.9) 


Transformations are carried out practically with the use of correspondence 
tables (see Tables 6.1 and 4.8 and also Hofmann, 1979). 


6.3 PROBABILISTIC ERROR MODELS 


6.3.1 General Comment 

Probabilistic error models are based on the practical experience that one 
cause may have several effects. Repeated measurements result in different 
measured values. In measurement processes random errors occur. Statistical 
regularities are the basis of all the measured values seemingly being scattering 
irregularly. 

Statistical error models have to be used as a basis for consideration when 
complicated objects, having various influence quantities, are to be investigated 
without being able to realize determined properties of the object and of the 
influence quantities because of technical, physical, economic, and time reasons. 
The incomplete object is, therefore, described by an incomplete model. Practice 
decides that the models used are convenient to the aims. 

Incomplete models are able to reflect the most important properties of an 
incomplete object with sufficient accuracy. The pertinent properties of random 
(stochastic) objects can be characterized by characteristic functions or by 
characteristic values derived from such functions. Typical characteristic functions 
are: 

probabilities for discrete random variables 

Pi = P{X = Xi) (6.10) 

probability densities for analog random variables 

p{Xi) = P(Xi X < X; + Ax)/Ax (6.11) 

and the distribution function 

F(.Xi) = P(X < Xf) = S Pi = f P(^i) 

UX<Xi J — 00 


( 6 . 12 ) 
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For estimating the performance of distribution laws the characteristic functions 
mentioned above are not always themselves used Instead charactenstic values 
(measures) derived from them are adequate These are called moments There 
are ordinary moments of k(h order 

m, = £[jr*] (613) 

and centra! moments of fcth order 

= (614) 

where E is called the expected lalue or mathematical expectation 
The ordinary first order moment is also called the arithmetic average lalue 

m, = £tJf3 = *=i(ix,j (615) 

Random quantities are said to be centred if the arithmetic average has been 
subtracted from them The central first-order moment is always 

Pi = £[(;( - £[X])3 = = 0 (6 16) 

The central second-order moment (dispersion, vanance, variation) is a measure 
of the scattenng of measured values about the average value 

o^ = Pi = = EZ(X - £[A'])^] (6 17) 

The mean square dev lation (standard deviation, vanation) is 

(618) 

The term variation is not uniformly used It is used for m Srmrnow and 
Dunm-Barkowski (1963), Storm (1967), and Muller (1975), but for n m Lange 
(1978), Renyi (1971), and Gellert (1977) 

The third-order central moment, divided by the third power of the standard 
deviation, characterizes the asymmetry of the distribution function and is 
called ske\^ ness It is 

73 = J's/®’ (6 19) 

With 

Pi = £[(X - mi)^3 and a = - m,)^]) 

The fourth-order central moment related to the fourth power of the standard 
deviation minus three charactenzes the gradation of the distnbution funclioo 
The expression 

74 = Wo*) - 3 = W/'i) - 3 (6 20) 

means excess of the random variable X, under the condition that the terms 
Pi = — ^ni)*] and = E[{X — exist For the normal distnbution 

/3 = 'A = 0 
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If p represents any real number (0 < p < 1), a number Qp with the properties 
F(Qp) = P(X < Qp) < p and P(X ^ Qp) ^ 1 — p is called a quartile of order 
p (or p-quantile) of the random variable. The O-S-quantile is the median. 


6.3.2 Discrete Digital Random Variables 

A discrete (digital) random variable X can represent many discrete enumerable 
values Xi in a finite interval. The random events of a one-dimensional random 
variable can be represented as dots (numbers) on a numerical straight line. 

A discrete (digital) random variable X can be characterized in the following 
ways: 


(a) By all possible, numerous values X;, which can represent it in the interval 

^max '^'min‘ 

(b) By the probability 

Pi = P(X = Xi) = P(Xi) = lim Vi/n i = 1, 2, . . . (6.21) 

00 

with which the special value xi occurs, where rt is the number of realiza- 
tions of Xi and n is the number of all realizations. These are 

0<pi<l; ^ pi = l (6.22) 

(c) By the probability density 

P(x) = S pAx - Xi) 

where p; is the probability that the random variable X will have the value 
Xi. The symbol (5(x) denotes the Dirac function (delta function) having the 
properties 


and 


S(x) 


00 if X = 0 

P if X 7^ 0 




(pix)5(y - x) 


dx = (p(y) 


with (p(x) being any function that is continuous at the point x = y. The 
function (5(x) is represented analytically by (Sweschnikow, 1970) 


dix) = 


2n 


fCO 

ej“* dftj 
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(d) By the distribution function 


I X€x, 


(623) 


which shows the probability of values occurring and being less than or 
equal to x, We have 

P(x, < X « Xj) = Y, A = - f(^i) 

• Z|<X<X2 

HXi)^F(X 2) lfXi<X2 
Iim F(X,) = F(-OO) = 0 hm F(x,) = F(co) = 1 

(e) By the arithmetic mean value (ordinary first-order moment) 

mi = ElX}=x = ^( £ x.r.) = Y x,P(x,)= £ ^>Pf 

” V~1 2 / .-I 2 ,= i 2 


if absolute convergence exists, that is, if 

f lx.lp. < 00 

<« I 

(f) By the variance or dispersion (second order central moment) 


C^ = fi2 = = Kx. - x)Vi - 

(g) By the mean square deviation (standard deviation) 

a = + 70 ' 


(624) 


(6 25) 


(6 26) 


63.3 Continuous Analog Random Variables 

A continuous (analog) random variable can randomly realize innumerable 
values (dots, numbers) of the number straight line within a finite interval An 
analog random process X(x{, t) provides n different realizations for n similar 
procedures (elementary events) 

An analog stochastic process X(x„ t) becomes an analog random variable 
^(x.), such as A'(x,i(l), x, 2 (lX »>^m( 1)) at a fixed time of t = 1, or a random 
time function, the so called realization x,(0, such as x,i(t), at a fixed elementary 
event 

In ergodic processes the time means equal the statistically expected values 
(Beyer eial, 1976, Lange, 1973, Dietnch, 1973) 
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An analog random variable, or a degenerated analog stochastic process, can 
be characterized by several methods: 

(a) By innumerably many values .x,- that can occur in the interval 

(b) By the probability density 

p{Xi) = lim [P(Xi < X < X£ + Ax)]/Ax = lim ?-,/nAx (6.27) 

*-*0 Ax -*0 

(c) By the probability 

Pix; < X < x,+i) = p(x) dx (6.28) 

(d) By the distribution function 

Hxi) = f pix) dx (6.29) 

— QO 


We have 


F(-oo) = 0 F(oo) = l 


and for F(xi < Tf < Xj) 


F(x) = f" Kx) 

•'Xi 


dx 


(e) By the linear average value (ordinary first-order moment) 


/•+ OO 


X = nti = 


xp(x) dx 




(6.30) 


1 

= hm — x(t) dt 
t=ii r->oo J-T 

(f) By the mean square value (ordinary second-order moment) 

00 1 ^ 

x^ = m 2 = x^p(x)dx= lim ™ x^(0df (6.31) 

J — CO r -* CO J — 7- 

(g) By the variance of dispersion (second-order central moment) 

= (x- x)V(^) dx = lim i f [x(f) - dt (6.32) 

J — 03 •' — 7* 

We have : 

= x^ — x^ 

(h) By the standard deviation 

0- = 


(6.33) 
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The properties of stochastic, stationary, ergodic random processes can further 
be described as follows The autocorrelation function is 

1 /*+’■ 

R„(t) = lim — x(l)x(( - t) dl = x{()x(l - T) (6 34) 

r-oo J-T 

We have 

K„(0) = j? 

= R„(^) — Rjurfoo) if 3c 0 

The Fourier transform is also applicable to the autocorrelation function, and as 
another characteristic function of random processes it results m the spectral 
pow er density gi\ en by 

Fx.Ow)= f '“”dt (635) 

J - B 

The autocorrelation function and the spectral power density function are Fourier 
transforms according to the Wiener-Chinchine formula 

P.M= f \.(T)e '“■dr = F{R„(t)} (636) 

= ”p„(j“>)e'"'<la) = f-*(P„(jtu)} (637) 

The graphical representation of the spectral density function is called the 
po\\er density specinim 

Randomly scattered measured values usually occur between (at the best 
case) the normal distribution function, having the probability density (Beyer 
er o/, 1976) given by, 

= wlh- (“S) 

and the distribution function 


and (the most unfavourable case) the uniform distribution or rectangular 
distribution with the probability density 
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and the distribution function 

fO if X ^ x„,-„ 

~ I (^i ^min)/(^max -^min) ^min X; ^ ^max (6-41) 
11 if X > Xn,„ 

The true distributions cannot be exactly known ; they are only estimated from the 
real, practical, measurement values. Statistical characteristics calculated by the 
formulae given are only estimations. The normal distribution function is usually 
assumed to apply ,unless circumstances arise which justify a closer study of the 
particular situation. 


6.3.4 Characteristics of Random Measurement Errors 

The following considerations are concerned with a situation in which the 
measured value Xa,, which is only influenced by systematic measurement 
errors, is made uncertain by additional stochastic disturbing influences. The 
deviation of each single measurement (of every event, of every value) Xai; from 
the ‘true’ measured value Xai is given by 

fiai = ^ali - ^al (6-42) 

cannot be calculated. The measured value Xg, is not comprehensible from 
measurement when random disturbances are influencing the measurement 
system. But, using the Gaussian method (method of the smallest error square 
sum) the arithmetic mean value (average value), see equation (6.15), 



is, from probability theory, the best value within the measured values Xai,- of a 
measurement series Xau, . . . , Xai,-, . . . ,Xai„. The mean value Xg, approaches 
the true value x^,. 

Instead of the mean value, other approximate values can be used which can 
be ascertained more easily. If the measurement series has n measured values 
arranged after their quantity, the median or central value x^i is, for odd n, the 
measured value in the middle of the arranged series or, for even n, the arithmetic 
mean of both measured values of the arranged measurement series which are in 
the middle. 

In most cases the difference between the median and the arithmetic mean 
value (calculated by equation (6.15)) is negligible. Simplification procedures 
however, are valuable. 

The span-width mean value 


■^M (-^maxa! T Xn,^„aj)/2 


(6.44) 
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generally shows a greater deviation from the arithmetic mean value For 
longer measurement senes it can better be ascertained than the median, because 
only the greatest measured value and the smallest measured value 
have to be established from the ungrouped frequency table which need not be 
rearranged 

The mode D (modal value, density mean value) is the value of the measure 
ment series that occurs most frequently at stated measured values 

The mean values of a measurement series, mentioned above (arithmetic 
mean value, median, span-width mean value, modal value), characterize a 
measurement series of n values by just one value But they cannot be used for 
describing the empiric distribution of the measured values deviation of the 
single measured values from the medium value also has to be given 

The deviation Uji, of each single measurement x.j, from the mean value x,,, 

«^.i. = JC.I. - -x:.! (645) 

can be calculated It is not, however, to be considered to be a representative 
random measurement error as it has a different value for each measurement and 
becomes zero in the sum 

On the other hand the mean square deviation of the single measurement Xgj, 
from the mean value x,(, that is, the standard deviation, 

^^1= “ 0"‘j (646) 

IS more suitable for error information The standard deviation can show limits 
mcludmg the confidence intervals In those intervals near the single measured 
value x,|, or mean value x,i the true value x,| is to be found with a known sta- 
tistical certainty For normally-distributed scattered measured values the 
confidence intervals of the single measurement are 

I’.iE = (647) 

and of the mean value 

The factor = /i(F, n) in equation (647) considers the uncertainty of as- 
certaining the standard deviation, the factor t/y/n = fiiP, n) in equation (6 48) 
considers the uncertainty of ascertaining the mean value The factors a, ki, t/Jn 
can be obtained from tables (Hofmann, 1979, Hultzsch, 1971) Extracts are 
shown in Table 6 2 

The choice of the best value for P depends on the systematic measurement 
error as well as upon technical and economic aspects For production inspection, 
in most cases, P = 95 % and, therefore, a = 1 96 es 2 are sufficient For 



Table 6.2 (a) Factors ki of the confidence limits of the 
standard deviation s^, of the measurement series of ?i single 
measurements for statistical uncertainty P. (b) Factors t/y/ri 
for half the confidence interval isjyjn of the medium value 
Xji of a measurement series with n single measurements and a 
known standard deviation and a chosen statistical certainty 

P. (c) Factor k of the outlier criterion for the elimination of 
gross errors 


(a) 


F = 95% 

P = 99% 

P = 99.73 % 

n 

k. 



6 

2.09 

3.00 

3.96 

8 

1.80 

2.38 

2.93 

10 

1.65 

2.08 

2.48 

12 

1.55 

1.90 

2.21 

15 

1.46 

1.73 

1.96 

20 

1.37 

1.58 

1.75 

25 

1.32 

1.49 

1.62 

30 

1.28 

1.42 

1.53 

40 

1.23 

1.34 

1.43 

50 

1.20 

1.30 

1.37 

(b) 


F = 95% 

P = 997o 

P = 99.73% 

n 

tf-J" 

tt-Jn 

tis/n 

6 

1.05 

1.65 

2.25 

8 

0.838 

1.24 

1.60 

10 

0.715 

1.03 

1.29 

12 

0.635 

0.898 

1.11 

15 

0.555 

0.700 

0.938 

20 

0.468 

0.640 

0.772 

25 

0.412 

0.560 

0.668 

30 

0.374 

0.504 

0.599 

40 

0.320 

0.429 

0.506 

50 

0.284 

0.379 

0.447 

(c) 


F = 95% 

P = 99% 

P = 99.73% 

n 

k 

k 

k 

9 

4.42 

7.10 

11.49 

10 

4.31 

6.99 

10.26 

12 

4.16 

6.38 

8.80 

15 

4.03 

5.88 

7.66 

20 

3.90 

5.41 

6.73 

25 

3.84 

5.14 

6.25 

30 

3.80 

5.00 

5.95 

40 

3.75 

4.82 

5.56 

50 

3.73 

4.70 

5.34 
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precision measurements P = 99%, and therefore a = 258 For a = 3 only 
0 27 % of all measurement errors are out of the tolerance band 

This confidence interval is generally called the practical maximum error of a 
single measurement 

If the normal distribution is assumed, the value = tr appears once in three 
observations, = 2<7 once in twenty-two observations and = 3a once m 
three hundred and seventy observations This means that random measure- 
ment errors are, for the single measurement, given by 

[x*i, - r = ±f»iE = ±akis^, (649) 

and for the measurement series 

- ■*.!]. r = “ ±<s,i/\/n (6 50) 

The square brackets indicate that error information thus indicated are barriers 
(maximum errors) depending on the number n of the used measured values and 
the statistical certainty P 

The square of the standard deviation is called dispenon or variance 

S?| = (6 51) 

In practical in\estigations, for instance, in control-chart technique (Hofmann, 
1979), It IS sufficient to mention the span width (scattering range, variation 
range) 

- ^,..1 (6 52) 

This uses the maximum value of the measurement senes, and the 

minimum value of the measurement series, instead of the standard deviation 
which cannot always be found as easily The variation coefficient denotes the 
relative variation of the single values near the mean value The variation co- 
efficient 

X 100% (6 53) 

is a dimensionless number that is especially suitable for comparing the precision 
(scattenng) of measurement senes of different empirical distributions 

63.5 Gross Measurement Errors 

In addition to random measurement errors there often occur gross measure 
ment errors (outliers is the term adopted here) which cannot always be recog- 
nized immediately Being considered as degenerated measurement errors, when 
identified, they are eliminated from the measurement series Gross measure- 
ment-error recognition is simplified by representing the single measurement 
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values Xaii in a probability graph (Hofmann, 1979). If the measuring person is 
not able to decide whether the measurement error is a gross or a great random 
one, the wild shot or outlier criterion 

l^ag - ^ai! > ks^i (6.54) 

is the basis of the decision at a given number n of measured values and a given 
statistical certainty P. On this occasion, however, it is presumed that all the 
other measured values, with excepted, are scattered according to a normal 
distribution. This assumption has to be tested. The values k = f(P, n) are to be 
seen in Table 6.2. 

6.3.6 Imperfectly Known Systematic Measurement Errors 

Insufficiently understood systematic measurement errors, as well as the un- 
certainties of calculated systematic measurement errors, must be estimated. 
They are summarized according to the Gaussian law of error propagation for 
random measurement errors. The resulting medium error /aj; has to be assigned 
a double sign and be added to the measurement uncertainty (to the random 
measurement errors). Arriving at satisfactory values demands a highly qualified 
measuring person: estimated values are often disputed. 

6.3.7 Formulation of Measurement Results 

The complete measurement result can be in two forms. The result can be calcu- 
lated from equations (6.1) and (6.49) for a single measured value Xau'- 

^aSE = ^alf - ± (a^l^al + /ax) (6-55) 

and from equations (6.1), (6.43), and (6.50) for the mean value of the measure- 

ment series: 

^aSM = ^al - ± {tS^i/y/u -f /^j) (6.56) 

The difference between x^se and x^sm is found in the different tolerance band- 
width. It is more precise to give XgSM- 

6.3.8 Error Propagation of Systematic Measurement Errors in Indirect 
Measurements 


Linear error-propagation law 

If the desired quantity y is a function of several measured values not depending 
on each other, Xj, X 2 , . . . , x„ (indirect measurement of y), that is 

y = /(xi, X 2 , . . . , x„) 


(6.57) 
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the systematic measurement error of the desired quantity y must be calculated 
from the systematic measurement errorsofthemeasuredquantitiesxi,X 2 , 

according to the linear error-propagation law 

(«58) 


This results from development of the function fin a Taylor senes and omission 
of all terms of second and higher power For relative measurement errors 


y ~ ,ti dx. y 


Ay 


(6 59) 


Procedures of logarithmic differentiation 

If only the relative measurement error Ayfy is to be calculated the logarithmic 
differentiation is a suitable procedure The function 

: = loga V (6 60) 


diR'erentiated gives 


dz = (log« t/v) dv (6 61) 

For natural logarithms equations (6 60) and (6 61) are 


z s In t 


(662) 


and 


dz = d(ln t) = dv/v (6 63) 

For calculating the relative measurement error by applying logarithmic dif 
ferentiation, equation (6 57) is converted into the logarithm format 

In y = In /{x,, X 2 , ,x„) (664) 

Differentiating equation (6 64) yields 


y 


V 


(665) 


If the differentials dx, in equation (6 64) are formally substituted by the syste 
matic measurement errors Ax, of the measured values x., the desired relative 
measurement error is found 


y 

y /=! dx. 


Ax. 


(6 66 ) 


Using equation (6 63), equation (6 66) can be expressed m the form of equation 
(6 59) The partial derivatives 5(In y)/3xj m equation (6 66) are relatively easily 
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to determine if the X; in equation ( 6 . 57 ) are mainly multiplicatively connected 
to each other. 

Example. If the desired function is 

y = 0X^X2 - xj 

with the measured values xj, Xj , X3 , X4 and the constants a, b, c, then 

In y = In a + fc In Xj — c In X2 + In X3 — ln(x3 — X4) 

and the relative measurement error of the function y =/(xi, X2, X3, X4) is 

Ay Axi Ax2 Ax3 Axj AX4 

— = b c 1 1 

y Xj X2 X3 X3 ~ X4 X3 — X4 

Measurement errors of typical measuring circuits and measuring functions 

Measurement-error analysis can be simplified if the absolute and relative 
measurement errors of typical measuring circuits and measuring functions are 
known. They have been grouped according to their characteristic measur- 
ing functions and measuring circuits in Tables 6.3 and 6.4 (Hofmann, 1979 ). 


Table 6.3 Relative measurement errors of typical measurement circuits 


Transfer 

No. Circuit Structure function Relative measurement error* 


1 series 


parallel 


3 (positive 
feedback) 


feedback 

(reversed 

feedback) 


— 4^}— S1S2 



Sj + S2 


S, 


AX) Ax2 

Xi X2 


jS j Axj ^ S 2 Ax2 


Si + S2 Xi Si + S2 X2 

1 Ax, 


■ + 


1 Ax, 


1 - S,S2 1 ~ S,S2 X, (l/SjSj) - 1) X2 

Si 1 Ax, 1 Ax 2 

1 + S,S 2 1 + S,S 2 ^ “ (I/S 1 S 2 ) + 1 


* The measurement errors Ax-Jx, are the relative measurement errors of the blocks i before their 
interconnection. 





Table 6 4 Absolute and relative measurement errors of typical measurement functions 
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COS xAx cot x^x 
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Moreover, in connection with error calculation, it is frequently possible to 
obtain important simplifications to formulae by making approximations for 
small numerical values. The most important of these formulae are shown in 
Table 6.5. 

6.3.9 Error Propagation of Random Measurement Errors in Indirect 
Measurements 

Squared error-propagation law 

If the desired quantity^ is a function of several measured quantities a:i,X 2 , . . ■,x„ 
being independent from each other, that is 

y = f(Xi,X 2 ,...,x„) 

the characteristic values of random measurement errors of the quantity y result 
from the characteristic values of random measurement errors for the measured 


Table 6.5 Approximation formulae for small number values 
(a, p,y,6 1) 


No. 

Equation 

Approximation 

1 

(1 ±a)(l ± /})•■• 

1 ± a ± ••• 

2 

(I ± a)" 

1 ± m 

3 

a±a)o ± [})■■■ 

(1 ±y)(l ±,5)--- 

1 + a + P + + ^ + 

4 

1 

1 + a 

1 + a 

5 

n/( 1 + a) 

1 +- 
“ 2 

6 

^(1 ± m) 

1 + a 

7 

1 

a 

’±2 


n/(1 ± «) 

8 

e* 

1 +« 

9 


1 + a In a 

10 

sin a; tan a 

a 

11 

COS a 

1 

12 

sin(g) + a) 

sin (p ± a cos (p 

13 

^/(ab) fl ~ 

+ b) 
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values X, according to the squared error-propagation law (Gaussian error 
propagation) 


Ay 


-(i,e-)T 


(667) 


If y IS a function of only two measured values Xi and Xj, then the linear law 


dy . 

T — Ax. 

dy 

■f" — ^^2 

dxi 



(6 68 ) 


can be applied (Hultzsch, 1971) 


Standard deuation of indirect measurements 


The expression for the standard deviation of the desired quantity y, follows 
equation (6 67) 




(669) 


Confidence intenals of indirect measurements 

The confidence intervals of the single measurement value (index E) as well as 
of the mean value (index M) of the measurement senes are calculated according 
to 

«)]■'' ( 670 ) 

6 4 INFORMATION-THEORETICAL ERROR MODELS 


6 4 1 General Remarks 

The informational content of an elementary event can be ascertained m two 
different ways The first arises when one is only mterested in knowing the 
existence or non-existence of an event (subject of the classic information theory 
of Shannon) A second situation arises when one additionally is mterested in 
knowing the mtensity, the amplitude, of the event (subject of modern measure- 
ment information theory) 

Withm discrete (digital) processes there are only discrete amplitudes to be 
found 

Continuous (analog) processes show seemingly continuous amplitude 
distnbutions Because of the existence of the measurement error Ax, only a 
Ax, correct identification of the amplitude is possible The amplitude, therefore, 
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should be considered to be quantized. If the measurement error Ax^ is introduced 
with a double sign, the number m of the differentiable amplitude steps of analog 
measuring devices is 

+ 1 (6.71) 

2|AxJ 


where x^axa is the maximum amplitude and x„,i„a the minimum amplitude. 

The following equation can be applied for establishing the number mp„ of 
the differentiable power steps: 




max a -'min a 


(6.72) 


where is the maximum power and the minimum power. 

For digital measuring devices 

m = /zTe + 1 (6.73) 


where/z is the counting frequency and 7^ the response time. 

The additative 1 in equations (6.71) and (6.73) indicates that the lowest 
values Xmina> ^mina Can appear and can be recognized. They are equal to zero 
as a rule. Frequently [(x^axa ~ Xn,i„a)/(2|Axa|)] > 1 showing that the second 
term in equations (6.71) and (6.73) is also negligible. 

As an example if a measurement system has a scale with a measurement range 
of (^maxa ~ ^mina) = 100 ^od a measurement error of Ax^ = + 1 % then m = 
(100/2) +1 = 51 amplitude steps can be differentiated. 


6.4.2 Information Characteristics 

Let m possible measured values be differentiable. Each value can be obtained by 

g = lb m = log 2 m = 3.32 logjo = 3.32 Ig m (6.74) 

in bit binary selection steps where these n possible values are coded by 0 or 1 
signals of the redundancy-free dual code. On the other hand a measured value 
can be represented on an information storage basis with q binary storage 
locations with a structural module (error) of 

m = 2’ (6.75) 

differentiable amplitude steps. For converting decimal numbers m into binary 
logarithms, lb m, the following chart can be used; 


m 

lb m 

m 

Ibm 

m 

lb m 

m 

lb m 

1 

0.00000 

4 


7 

2.80734 

10 

3.32193 

2 

1.00000 

5 

2.32193 

8 

3.00000 

mm 

6.64386 

3 

1.58496 

6 

2.58496 

9 

3.16993 

■1 

9.96579 
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This means, say, for characterizing m = 10 values 4 bits are needed, that is, a 
four-6gure binary number For w = 1000 values 10 bits are needed — a ten- 
figure binary number The number q of the selection steps needed for the 
representation of the decimal number m is called the information quantity oi the 
expense It describes the static properties of the informational system 
In any practical case the useful signal (information) at the measurement- 
system output IS superimposed with a disturbance For the receiver the informa- 
tion to be accepted is not determined in advance The information is only 
given a greater or smaller probability Information processes are stochastic 
ones Their mathematical model is the probability theory 
The information content / -► 0 is very small if an event x,, expected with a 
great probability P, -» 1, occurs The information content / -» co is very great 
if an event x„ expected with the very small probability P, -» 0, occurs This is 
modelled by the function / = log(I/P) If m values of x, appear with the same 
probability in a stochastic process the probability for the appearance of a 
value X, IS given by 

P,(X = X,) = 1/m (676) 

Putting equation (676) into the expression (674) leads to the information 
quantity (codmg expense), m bit units, 

g. = lb(I/P,) = -IbP, (677) 

If m symbols s, (numerals or numeral sequences, letters, words) appear with 
different probability P, per symbol s, they have different coding expenses The 
medium’s codmg expense (medium information quantity) is called information 
entropy and is given by equations (6 24) and (6 77) for discrete amplitudes 

H = £[g] = g ^ P,g, = — ^ P, lb P, bits/symbol (678) 

>»i • I 

The information entropy H is a measure having two meanings (Lange, 1978) 

(a) for all symbols of the alphabet, it characterizes the medium codmg expense 
per symbol in bits per symbol, 

(b) for one symbol of the alphabet, it characterizes the medium’s information 
content per symbol in bits per symbol 

Working with continuous (analog) signals we obtain, for discrete amplitude 
steps Ax -♦ 0, the probabibty density p, for the appearance of an amplitude value 
in the range x, to x, -f Ax The probability is given by 

P, = p.Ax (679) 

The medium’s coding expense (medium information quantity or information 
entropy)is,forcontinuoussignaIs,giv'en by equations (6 78) and (6 79) 

H = — hm Y, PiAxlb(p,Ax) (6 80) 
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We say 

2 PiAx = 1 
i= — 00 

For the transition to infinitesimally small quantization steps the information 
entropy is 

ff = - p,-Ibp,-dx+ linilb(l/Ax) (6.81) 

•^ — 00 

The second term of equation (6.81) causes it to diverge and becomes infinitely 
great in the borderline case. This contradicts practical experience and the 
contents of equation (6.71). The second term is often not considered (Krauss and 
Woschni, 1975). When calculating information-entropy differences it dis- 
appears. 

A continuous stochastic useful signal x^t) carries a medium information 
quantity or input entropy according to 

/•CO 

H{X^) = - p(:>Ce)lb p(Xe) dx i- lim lb(l/Ax) (6.82) 

•'-00 Ax -*0 

at the input of an error-free measurement system. The information can be 
partly lost because of input disturbances. This part is called equivocation: 

mxjx,,). 

Output disturbances cause an output signal to contain misinformation. 
This information quantity of the message medium is called irrelevance or dis- 
sipation: H(X^j\Xc). 

The medium information quantity actually transferred is called trans- 
information H{Xe; Xjj). 

The medium information quantity at the measuring-system output, i.e. the 
output entropy H(X^i), is a combination of the irrelevance and the trans- 
information. 

If both the useful signal Xe(t) and the disturbing signal rft) follow the Gaussian 
probability density function then 

From equations (6.83) and (6.82) it can be calculated that: 

(a) input entropy 

H(X^) = \\h\2ne x^(t)] + lim (1/Ax) (6.84) 

AJC-.0 

(b) irrelevance (dissipation) 

I AT e) = ilb(27re rl{t) + lim (1/Ax)) 

Ax->0 


(6.85) 
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(c) output entropy 

2Ke Cx^(t) + r^(01} + Iim(l/Ax) (6 86) 

4 r **0 

(d) fransinformation 

A'.O = i lb{l + (687) 

The transinformation is a property of the transferring channel, related to a 
certain source, but it is not only a typical property of the transfernng channel 
For all calculations natural logarithms are preferred It should also be con- 
sidered that the numerical values of the information quantities depend on the 
units used Calculations with binary logarithms (lb) have the unit bit, those 
calculated with natural logarithms (In) the unit nit and calculations with 
decadic logarithms (logjo) the unit dit The following conversion formulae 
can he used (Novickij, 1978) 

I bn = 0 69 ml = 0 30 dit 
1 nit = 1 45 bit =5 043 dit 
1 dit = 2 30 nit = 330 bit 


6 4 3 Information-theoretical Measurement Error Characteristics 

Measures of errors occurring are the linear medium value of the measurement 
error 


Ax. = J Ax.KAxJ dx = I™ ^ J [x.](0 - XtiCO] <i< (6 **) 

and the square medium value of the measurement error 

Axi=f Ax;p(AxJ<1x= lim f MO - x.s(0]* * (6 89) 

where x,j(i) is the enoncous output signal of the measurement system and 
•*^as(0 Ibe error-free output signal of the measurement system 
Normally distributed desired and disturbing signals m a stationary, ergodic, 
stochastic process are the basis of the Bayes model The square medium value 
of the measurement error in equation (6 89) is called the Bayes risk (Krauss and 
Woschm, 1975, Seidler, 1971) 

Furthermore the medium square deviation s,i of the single measurement 
value Xj, from the medium value x^, that is, the standard deviation accordmg to 
equation (6 46), has been introduced as an error measure 
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The standard deviation multiplied by a constant factor, a ^ 1, results in the 
confidence intervals that are indicators of the maximum errors. These maximum 
errors have two principal disadvantages: 

(a) the selection of factor a can be accomplished arbitrarily; 

(b) the longer the measurement series becomes, the greater the probability of a 
growing maximum error. 

On the basis of the normal distribution the following distribution of the maxi- 
mum error is to be expected : 

" Sal on nn average, once in 3 measurements 
2Sai on an average, once in 22 measurements 
“ 3Sai on an average, once in 310 measurements 
.4Sa, on an average, once in 15000 measurements 

This results in different prerequisites to the estimates of measurement processes 
on the basis of short and long measurement series. 

Moreover, having a limited number of measured values in many situations, 
it is often impossible to obtain the confidence interval (maximum error) itself. 
Only an approximation of the confidence interval, its estimated value, is calcul- 
able. When only a small number of measured values are available, it will not be 
possible to make any reasonable statement on the probably occurring maximum 
values. Therefore Novickij (1978) suggested use of the entropy value rj of the 
measurement error. 

For any distribution of the measurement error the information quantity can 
be described by equations (6.71) and (6.74). To simplify the calculation, with the 
help of logarithms, 

q = lnm = ln[(x „,„3 - = ln(x^^^ - J -Ind (6.90) 

where d is the error interval. Furthermore 

q = H(ZJ - H(X,/XJ (6.91) 

The information entropy of the error is estimated under the simplifying sup- 
position that equivocation and irrelevance are identical: 

H(Xe|XJ = H(X„|X,) = - f” p{x)ln p(x) dx (6.92) 

V — CX3 

If the errors are normally distributed, see equation (6.83), then 
In p(x) = — ln[(T.y(27r)] — x^l{la^) 




dx = 


and with 
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then 

H{XJX.i = lnC<rV(2!t =)] 

To formally define the entropy error A, the following relations are used 

H(XJX^ = In d, = In 2A, (6 93) 

d, = 2\ = cxpiH(X,/X„):i (6 94) 

A, = ieKp[ff(3r./A:.0] (695) 

From Novicijj (1978) 

(a) for the normal distribution 

A. = <ry(ne/2) = 2066 

(b) for the equal distribution 

A, = 0^3 = 1 73 

(c) for the triangular distribution 

A, = <TV(3e/2) = 202 
The entropy coefficient is defined as 

k = AJc (696) 

From above 

A. = ka in * (r„„„ - x„,„„)/ 2X(T ^ * In m 

The commonly used confidence interval and the entropy error are similar 
The measurement result is given using 

= ^»ii — Ax, ± A, {6 97) 

6.4.4 Channel Capacity 

The dynamic properties of measurement information systems can be described 
by the information flow or the channel capacity per unit lime 

Ct = ^ lb m bits/s (6 5^) 

where 7^ is the response time The information flow (flow of binary numbers) is 
greater than the symbol flow if the symbols are each formed by several binary 
numbers (0, 1) Let 


Cj = K/ btts/s 
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where V is the symbol flow in symbols/s and I the coding expense (decision 
contents) per symbol in bits/symbol. In compliance with the sampling theorem 

Te = l/(2/e) (6.99) 

gives the relation between the sampling time T^, or response time 7^, and the 
cut-off frequency 

Equation (6.98) can also be stated as 

Cj = 2/, lb m bits/s (6.100) 

For analog measurement systems the channel capacities have to be formed with 
equation (6.71). For digital measurement systems equation (6.73) is used. The 
channel capacity per unit time, Cy, is approximately equal to the maximum 
transinformation flow through a measurement system. It describes the static and 
dynamic behaviour of a measurement system. In doing so the channel capacity 
does not provide any other statements on the static and dynamic behaviour 
of a measurement system other than the differentiable amplitude step number m 
and the response time 7^. 


6.5 EXAMPLE 


6.5.1 Influence of Multiplicative Measurement Errors on the Measurement 
Result 

A thermocouple, a thermostat, a moving-coil galvanometer, and a flow channel 
with a slide valve (Figure 6.2) are given. A step change of temperature is to be 
measured. The thermocouple has a static measured-value sensitivity of 

Sxh = W^eTh = 0.05 mV/K (6.101) 

The moving-coil instrument has been calibrated with a sensitivity of ^NjCr-Ni = 
^aTh/^eTh = 0.04 mV/K for NiCr-Ni thermocouples and, therefore, it has the 
static sensitivity (Hofmann, 1977) 

Sg = x^g/x^g = ^eTh/^aTh = 25 K/mV (6.102) 

If temperature changes are to be indicated inertia-free and correctly by the 
moving-coil instrument, the following transfer function of the total system with 
thermocouple plus moving-coil instrument applies: 

Ss(p) = 1 K/K (6.103) 

This is the transfer function for an inertia-free (zero-order) proportional 
system with the proportionality factor 1. Thermocouples, however, are slow- 
acting. If the dynamic properties of the total system (thermocouple plus moving 
coil instrument, see Figure 6.2) are determined only by the thermocouple— this 
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Figure 6 2 Progress of systematic and random measurement errors in some temperature 
measurements (a) Components of the measurement circuit, (b) signal flow diagram of the 
measurement circuii, (e) progress of systematic and random measurement errors The 
follow mg table gives values of the measurement senes 1 to 3 


No 

Xjii 

^>12 

.*>13 


^IS 

^»|6 

X,,7 

X,is 

X,|9 

^illO 

1 

22 56 

22 65 

22 35 

22 53 

2Z47 

2259 

2Z40 

22 57 

2245 

2248 

2 

2245 

22.53 

2248 

2349 

2251 

2247 

22 65 

2135 

22 40 

2257 

3 

2001 

20 15 

19 85 

2003 

1997 

2009 

1990 

2007 

1995 

1998 
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is a simplification— the following dynamic measurement-system behaviour 
results from it. 

The spherical measuringjunction of the thermocouple is at a constant temper- 
ature at time t = 0 it is quickly exposed to a medium at a constant 

temperature 0^ > ■ The temperature change inside the sphere is a function of 

time and space given by a sum of exponential functions with different coefficients 
and negative real exponents (Hofmann, 1976). A small sphere radius, a large 
coefficient of thermal conductivity A or a small coefficient of heat transfer a can 
imply a negligible development of all components of order greater than one. 
If this is the case the spherical sensor heats internally as a first-order inertial 
system and the thermocouple generates an output voltage (in mV) given by 


where 


^aTh(0 — -^aTh 



3oct\ 

cpR/ 


(6.104) 


c = specific heat 
p = density 

R = radius of the thermocouple-sphere 
^aTh = output voltage in the stationary state. 

The time constant of the thermoelement is 


cpR/3« = T (6.105) 

For c = 5 X 10"^ kJ/(kg K), p = 1 x 10“^ kg/m^ R = 3 x 10~^ m, and a = 
100 W/(m^K) (stirred air) the time constant is t = 50 s. 

Thus the actual measured-value transfer function of thermocouple is 


^ThiCp) — 


T{^aTh(0} -^aTh/^eTh 


T{x,Th(0} 1 + TP 

with (see Table 6.1, entry nos 1, 6) 


as well as 

and 




^eTh — ~ T — 50 S 

^aV^cTh = 0.05 mV/K 


(6.106) 


(6.107) 


The actual transfer function of the total measurement system consisting of 
thermocouple and moving-coil instrument is described by equations (6.102) 
and (6.106), as well as Figure 6.2: 


S„(P) = SrMScip) = 1.25/(1 -f Tp) K/K 


(6.108) 
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The intemal error transfer function Sf^(p) of the measurement system is, 
according to the equations (6 108) and (6 103), 

Theabsolutesystematic internal erroroflhcmeasurementsystem(themultiplica- 
ti\e measurement error) follows equation (6 109) in the p-range (complex 
frequency range) 

AiaEW = Sre(p)S^) = y K (6110) 

With Xts(p) = 20 K/p because x^(p) = 5s(p)x,s(p) and Ss(p) = 1 K/K as well as 
XfsiP) = 20 K/pforastepchangeoftemperaturefrom 11^ = 50 “CIoIIm = 70X 
with X,s = X.Th 

In the i-range the multiplicative measurement error (using equation (6 110) 
and Table 6 1, entry nos 1, 6) has the behaviour 

Ax.e( 0= 5(1 - 5c "’‘**)K (6111) 


6£^ Influence of Addithe Measurement Errors on the Measurement Result 

An additne measurement error appears when the reference junction is not at the 
prescnbed temperature 0^ = 50 "C but is at 0^ = 52 “C The disturbance 
variable Ax*f = U, - (1* = -2 K produces a counter voltage by the reference 
junction which has the same temperature sensitivity S» *= 005 mV/K as the 
measunng junction, that means Ax.p = S.Ax.p = —0 1 mV The inertia of the 
reference junction has not to be taken mto consideration because the tempera- 
ture of the reference junction has already reached its stationary value at the 
beginning of the measunng process and will remain constant dunng the 
measurement 

The absolute systematic external error is shown in Figure 6 2 for the measure- 
ment chain consistmg of reference junction and moving-coil instrument with 
sensitivity Sq m equation (6 102) 

Ax.f = -0 1 mV X 25 K/mV = -2 5 K (6 112) 


6^3 Influence of Systematic Measurement Errors on the Measurement 
Result 

The absolute systematic measurement error consists additively of the internal 
error (multiplicative measurement error) and the external error (additive 
measurement error), so that equations (6 111) and (6 112) will finally result ui 

Ax.,(t) = Ax.e(i) + A*.,(0 = 2 5(l - I0e-«"')K (6113) 



MEASUREMENT ERRORS, PROBABILITY, AND INFORMATION THEORY 


273 


The step response from 50°C to 70°C at the thermocouple gives, instead of the 
true behaviour of 

^as(0 = 20 K for t > 0 (6.114) 

the real behaviour of the sensor 

Xa.(0 = x,s(t) + Ax„(t) = [20 + 2.5(1 - 10 K (6.115) 

as simulated by the moving-coil instrument. For t > t there exists a stationary 
value of 22.5 K (Figure 6.2) 

6.5.4 Influence of Random Measurement Errors on the Measurement Result 

Random measurement errors originate from such causes as parasitic irregular 
interference fields which can induce stochastic disturbing voltages in the un- 
screened wires of the thermocouple. Therefore, the behaviour of the output 
quantity oscillates randomly about the undisturbed behaviour (after equation 
(6.115)). The measured values (Figure 6.2) were ascertained in the stationary 
state (f p t) for repeated measurements. 

On the basis of measurement series 1 the arithmetic mean value, using equation 
(6.43), is given by 

= 22.5 K (6.116) 

The standard deviation is, from equation (6.46), 

/lO \l/2 

Sa. = ( Z(Xa,.- - x„)V9 1 = 0.09 K (6.117) 

The confidence interval of a single measurement (see equation (6.47)) is, for the 
statistical certainty P = 95 % and n = 10 measured values, 

t^aiE = ±afciSai = 1.96 X 1.65 X 0.09 K = 0.29 K (6.118) 

The confidence interval of a mean value (see equation (6.48)) is, for the stat- 
istical certainty P = 95 % and n = 10 measured values, 

t^aiM = ±5ai?/V« = ±0.715 X 0.09 K = ±0.064 K (6.119) 

The unknown systematic measurement errors/aj are to be zero for the measure- 
ment system in Figure 6.2. 

6.5.5 Formation of the Complete Measurement Result 

The complete measurement result consists of all components calculated up to 
now. The desired value of the output quantities is: 

(a) for a single measured value with equations (6.55), (6.1 13) and (6.1 18), 

XaE = [22.56 - 2.5(1 - 10 e''^^®*) ± 0.29] K (6.120) 
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(b) for the medium value with equations (6 56), (6 1 16) and (6 1 19), 

= [22 5 - 2 5(1 - 10 ± 0 064] K (6 121) 

Concerning the measurement senes m Figure 6 2 it should be said that the 
measured \alue in measurement senes 2 probably represents a gross 
measurement error (outlier) Application oftheoutliercnterion (equation (6 54)) 
says that if (23 49 K - 22 5 K) > 442 x 009 K, then x,i 4 is to be excluded 
Because 0 99 K > 0 4 K this assumption is confirmed Before measurement 
series 3 in Figure 6 2 was taken a correction had been carried out In the present 
case the comparison junction temperature was set at 50 °C, and the scale of the 
moving coil galvanometer with a scaling of 25 K/mV was replaced by a scale 
with a scaling of 20 K/mV This resulted in a 100% correction of the static 
measurement error The correcuon of dynamic measurement errors has been 
discussed in detail in the literature (Hofmann. 1976, Hofmann, 1977) 
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CJ.D.M. VERHAGEN, R.P.W. DUIN, F.A. GERRITSEN, 
F.C.A. GROEN, J.C. JOOSTEN, AND P.W. VERBEEK 


Pattern Recognition 


Editorial introduction 

The simplest form of measurement situation arises when variables that are to be mapped 
into measurement data are easily identified as singular quantities that are amenable to 
sensing by straightforward well-established procedures. Complex system situations often 
involve monitoring a great number of measurements in order to give ultimately a few output 
quantities. 

Another approach is to attempt what the authors of this chapter so aptly term, a many- 
to-one transformation right at the measurement interface with the system. This procedure 
is being developed and reported in what has generally become known as pattern recog- 
nition. Patterns occur in many circumstances. Examples are found in ore-sorting, hand- 
written address recognition, defects in tin-plane stock, abnormalities of forest growth, 
structure of spoken language, artistic creations, social behaviour, economic trends, and 
so on. In cases such as these, the measurement need is often to decide into which of a few 
defined classes certain features of the pattern should be placed. 

This chapter is concerned with the methodology and practice that has been developed. 
Although the subject has not, in general, matured into useful application to the extent 
that was envisaged a decade or so ago it has nevertheless provided workable, economic, 
solutions to many a real-world requirement. The material presented (compiled in 1979) will 
assist assessment of potential situations and provide a valuable springboard into this 
important aspect of many measurement systems. 


7.1 INTRODUCTION 


7.1,1 The Process of Pattern Recognition 

Pattern recognition means here: automatic pattern recognition; pattern 
recognition in humans and animals is not taken into account. The field of pattern 
recognition is very broad notwithstanding this restriction, as both the terms 
pattern and recognition include very different entities or activities (Verhagen, 
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1975a) Pattern recognition systems, however, have some points m common 
they are data or information processing systems, with fairly complex input data 
and fairly simple output data The input data often are collected by means of 
physical measuring instruments from sources in the outside world, but other 
data (for instance, economic, sooographic) may also be used The sources and 
the input data are presumed to contain ‘patterns', mostly together with noise, 
disturbances and background data It is the task of the data processing system 
to assign the input data to a certain class m accordance with the source pattern 
It IS essential m pattern recognition that different input data should be classified 
w the same class It is also true, however, that the same sources and the same 
input data may contain several different types of patterns and so— for different 
problems— ha%e to be classified into different types of classes A few examples 
may clarify these statements 

(a) All different ways of writing the character ‘A’ produce different inputs to, 
for example, a TV scanner, but the output always has to be a class with 
indication A\ one of the classes of the alphabet In a different problem 
however, a certain character ‘A has to be classified as belonging to a 
certain font, or written by a certain penman, and so to quite different types 
of classes to the classes of the fonts or the classes of the individual penmen, 
respectively, instead of to the classes of the characters of the alphabet 

(b) Quite different scenes may all be classified as belonging to the class of 
scenes containing a certain vehicle, but m a different problem a certain 
scene with that vehicle may have to be classified as a sharp image, or as an 
image containing a vehicle with a certain number on its number plate, or 
driven by a certain driver 

The type of patterns and so the type of classes that are of interest at a certain 
moment to a certain observer is determined by his choice, this choice has to be 
made before a pattern recognition system can be developed We assume from 
now on that this choice has been made 
A fundamental question is what different representations of a certain pattern 
m a source are presumed to be indicated by a certain class'^ For instance, what 
different forms of the character *A’ have to be taken into account, and do 
ornamental characters have to be recognized loo*? This question is mostly very 
difficult to answer, it is related to the variability of the representations of the 
patterns of each class Variability of the representations of the patterns has 
many causes (Section 7 1 2) and is mostly not very explicitly formulated The 
pattern recognizer should analyse the variability and find attributes or features 
and decision methods to cope with the variability that is present 
As a pattern recognition system should map many different representations 
of a certain pattern, present in different soun^, into the same class, one may 
describe this pattern recognition problem as a man}-to one mapping 
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7.1.2 Variability 

The variability of the representations of the patterns belonging to a certain class 
is an essential difficulty in pattern recognition. Several causes may exist for this 
variability. 

Natural patterns from biological origin vary from specimen to specimen, and 
in humans and animals the shape or expression of one specimen can vary 
considerably as a function of time (consider face recognition). 

Cultural patterns, such as printed or written characters, are roughly deter- 
mined by human conventions, but conventions change, depend on personal 
interpretation, and are difficult to define. An artist may develop his own style of 
characters ; as long as they are recognized by (some) humans they are acceptable. 

In addition to these essential variabilities, the variability resulting from the 
treatment and preparation of the sources with the patterns may be important. 
The illumination of objects or the direction from which they are viewed greatly 
determine the input to the sensors of the pattern recognition system. The same 
is true for the way biological objects, like tissues or cells, are prepared (Figure 
7.1). 

Variability of the input data also originates from noise in the communication 
channel between the sources and the measuring instrument and from noise and 
imperfections of the measuring system. One specific source with a specific 
pattern may be led, by these causes, to many mutual different inputs to the 
pattern recognition system. 

The natural and cultural variabilities have to be seen as essential to pattern 
recognition. The preparation and treatment of objects, the communication 
channel, and the measuring system, however, are preferably chosen in such a 
way that the variability added is — if possible— small in relation to the essential 
pattern variability. This is another way of stating a general rule of measurement, 
that is, that the measuring system should not change the situation more than 
corresponds with the accuracy pursued. 

The situation in pattern recognition is quite complex because a great many 
types of distortions and noise may be present and the properties of the patterns 
may be quite different. If high spatial frequency details of an image are essential 
for the patterns under consideration the requirements are not comparable to 
the situation where the low-frequency shape of a curve has to be analysed. As a 
consequence, the preparation and treatment of the objects and the communi- 
cation and measuring process require great attention for each individual pattern 
recognition problem. This sometimes means that standardization of preparation 
and treatment techniques and reduction of noise or distortion influences may 
be necessary. It also means that specialized measuring systems for pattern 
recognition may be appropriate. This is one of the reasons why it seemed useful 
to add this chapter on pattern recognition to this handbook. 
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Figure 7 1 Chromosome number 1 from diffcrcnl 
individuals Although variability can result from in 
herited variations m the size of certain bands it is 
mainly caused by chromosome preparation and staining 
(Courtesy M v d Ploeg Leyden) 
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Lateron.howtodeal with variability will be discussed; two general approaches 
are; 

(a) estimate statistical parameters to describe variability; 

(b) use a priori physical knowledge of the sources, the patterns, the communi- 
cation channel, and the measuring instrument. 

7.1.3 Features 

The input data are often very redundant. An image of 256 x 256 picture 
elements with 8 bits per element contains more than 5x10® bits, which have 
to be reduced to a few bits indicating the classes involved. In addition to this, 
the input data show much variability. For these reasons it is necessary to look 
for attributes or features in these data that may be more or less characteristic for 
the patterns to be recognized and that are far less redundant than the original 
input data. Preprocessing the input data has to produce a limited amount of 
feature values as a basis for the decision process, which results in classes at the 
output of the recognition system. The type of preprocessing needed and the type 
of features suitable cannot be found in a straightforward way. Physical and 
a priori knowledge about the sources of variability is essential as a guideline. 
Statistical methods may be helpful to select features (see Section 7.2.2) but in 
addition intuition, experience, and/or trial and error are sometimes necessary 
to find suitable features. This determines in great part the difficulty of the pattern 
recognition process. 


7.1.4 Formal Description of Pattern Recognition 

Pattern recognition has been characterized in Section 7.1.1 as a many-to-one 
mapping; ‘many’ indicates that pattern recognition deals with many different 
but equivalent representations of a certain pattern. In a similar way to that used 
in Chapter 1, pattern recognition may now formally be described by a mapping 
m from a set of sources S,-, containing all different but equivalent representations 
of the pattern i, into a class C,-, the name or code for this pattern; this has to 
hold for all patterns i under consideration. With ‘~s,’ as indication for the 
equivalence relation on the set of sources S,- and ‘ = ’ the identity relation we have 
rn. Sj ~y C; must be a homomorphism of <5,-, ~s,) into <Cf, =), which means 
that if s,y and s;^ are two representatives of S„ so s^j, s,,- e Sf, and Sy ~ sfiv this 
implies that m(sy) = (Finkelstein, 1975; Verhagen, 1975b). 

If the set of sources S,- is mapped by nij (denoting sensing and preprocessing) 
into the set of features F,-, and further by mi (denoting decision, discrimination) 
into the set of classes C,-,the features have to provide the possibility of introducing 
an equivalence relation ~ on the set of features, which allows mapping of the 

eatures in a homomorphic way into the classes. Figure 7.2 shows the situation 
in a global wa}'. 
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Figure 72. Formal descnption of 
pattern recognition by sensing and 
preprocessing m, and decision m 2 
based upon an equivalence relation 
- f on the set of features F, 


Sources Input dato Features dosses 

( meosuring iJotol 

— — ^ Sensor I 1 ■ j Preprocessor J-C Clossifier j — 

Future 7 3 Block diagram of a pattern recognition system 

A b/ock diagram (Figure 73) represents, in a more technical language, the 
same idea as Figure 7 2 

7.1 .5 Tj-pcs of Patterns, Features, and Pattern Recognition Systems 
It IS difficult to describe the difTerenl types of patterns adequately, together with 
the related features and recognition systems, because there is a great variety of 
patterns A very rough distinction may be made as follows 

(a) Patierns that can be described by analog or digital lalues of a number of 
features The \ ariability of the patterns also causes v anabilily m the feature 
values This variability in the features can sometimes be handled by means 
ofstaiistical methods The correspondingmathemalical tools for allocation 
to the classes are called statistical discriminant methods (see Section 7 2.2) 

(b) Patterns that can be decomposed insubpatterns and even smaller subpatterns 
(the smallest of these subpattems are often called primitives) and that can 
be described by these subpattems or primitives and their mutual relations 
A line figure, for instance, can be comjjosed of straight and curved lines 
placed in certain relation lo each other The patterns sometimes can be 
expressed m a language generated by a grammar In that case linguistic 
methods (ako called structural or syntactic methods) can be used to describe 
patterns Recognition requires parsing after determination of the primitives 
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and the mutual relations. In some special cases the grammar needed for 
analysing can be found based upon a finite set of characteristic sample 
patterns. This process is called grammatical inference (Section 7.2.3). 

The patterns and the classes in which they are classified mostly concern 
discrete classes; a pattern belongs to a certain class or not. Fuzzy patterns, 
where each pattern can have a membership value between 0 and 1 of several 
classes at a time, currently receive much attention (Section 7.2.2). Sometimes 
an allocation to a continuous variable is possible by interpolation between 
classes. 

A learning set of representations of the patterns under discussion is often 
available, together with labels indicating the class of each representation. This 
set should be used during the development and training of a pattern recognition 
system to learn about the variability of the representations of the patterns and 
to develop some (optimal) classification scheme. A large and representative 
learning set facilitates such a development but, unfortunately, such an ideal set 
is seldom available. If no representations with a label of their respective classes 
are given, an analysis of the input data may show whether so called clusters can 
be found, indicating the existence of regularities that might be called patterns 
(learning without a teacher). 

Finally, one may distinguish between pattern recognition in a strict sense, 
where human recognition of visual, auditive or other patterns is automatized, 
and pattern recognition in a very broad sense, where all types of (even not yet 
known) regularities in data are taken into account and where pattern recognition 
techniques are used to analyse and classify the data. These data may have their 
source in such fields as the economy and scientific research. 

7.1.6 Pattern Recognition and Nominal Measurement 

The definition of nominal measurement is not unique. Originally (Stevens, 
1946; Ellis, 1966) in a nominal measurement one must only be able to distinguish 
whether two representatives of a certain entity are equal in a certain aspect or 
not. No order in the entities whatever is under discussion. A standard collection 
of entities with names or codes is often available (standard colours for instance). 
This allows one to compare unknown entities with the items of the collection 
and to determine which one of the collection, if any, equals the unknown. The 
name or code of the item of the collection that equals the unknown is given to 
the latter. So one only has to be able to determine identity (sameness) and 
difference. These words are sometimes repeated in recent papers (Finkelstein, 
1973, p. 18; ‘Measures on a nominal scale merely describe whether two entities 
are identical or different’), but ‘identical’ (and ‘equal’) are broadly interpreted, 
as nearly equal’, ‘similar’ and as ‘equivalent’ (in the sense of equivalence 
relation) (Finkelstein, 1973, p. 17). A relation with pattern recognition as 



284 


HANDBOOK OF MEASUREXfENT SCIENCE 


described m the previous sections will be evident nommaf measurement and 
pattern recognition share the baste idea of the equnalence relation Aspects 
like colour and hardness, are determined as equal or equivalent in nominal 
measurements, the aspect ‘pattern’ is regarded in the equivalence relation as 
used m pattern recognition 

The determination of equality, similarity or equivalence of items like colour 
and of visual or auditive patterns, as performed by human (and animal) per- 
ception seems to be quite easy as a product of evolution (and an immense 
learning set), but it is not well known yet how this works Automatic and objec 
tive determination of equality, similarity or equivalence is difficult, computer 
evolution has not been directed toward perception and association 

No measuring scale like a ratio, interval or even ordinal scale was available 
for traditional nominal measurements Much research was necessary to find, 
for instance, which physical quantities (with Iheir scales) had to be used to 
compare colours as experienced by man, objectively (luminance, hue, satura- 
tion) The situation to define objective equivalence measures for all types of 
patterns is still more difficult In the terminology of the preceding sections it can 
be said that features have to be found which characterize the patterns involved 
in a suitable way. they have to discard the information which is considered 
irrelevant to the problem and to reveal what is relevant 

It may be remarked that an artificial pattern recognition system that success 
fully performs the classification tasks of a biological system also constitutes a 
model for it Here lies a relation between percepuon and pattern recognition 
research 


7.1 7 Pattern Recognition and Measuring 

The discussion m the preceding section already indicated a relation between 
pattern recognition and measuring Another reason for this relation is that the 
input of pattern recognition systems often consists of physical measuring data 
and that, as treated m Section 712, the properties of the data acquisition system 
arc important for the pattern recognition system 
There is a further reason to stress the relation between both fields The 
ultimate purpose of a measurement is often not only to produce measured 
values of unknown quantities, the intention is mostly to interpret these values 
and to draw conclusions from them They may be scientific conclusions (an 
hypothesis is justified or not by a measurement) or technical conclusions (a 
certain situation is all right or is not, in the latter case a readjustment of the 
situation by feedback might be necessary) Conclusions that can be drawn 
from the measured values m a simple way (for instance, the value of a tempera- 
ture is above a threshold) are usually not described as a pattern recognition 
process But in those cases where complex data and a complex analysis of the 
data are necessary in order to draw conclusions, and are such that pattern 
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recognition methods are necessary, it will be clear that pattern recognition is 
essentially linked to the measuring process. 


7.1.8 Pattern Recognition and Other Disciplines 

Pattern recognition is not only connected with the measuring field, but also 
with many other fields, and this for two reasons: other disciplines apply auto- 
matic pattern recognition methods in order to solve some of their own problems, 
and pattern recognition borrows knowledge and methods from other fields 
(Verhagen, 1975a). In addition to some disciplines already mentioned before, 
like statistics, linguistics, and physics, (including measuring) pattern recognition 
uses knowledge from, for instance, mathematics, computer science, information 
theory, artificial intelligence, and perception science. 

The following survey (current in 1979) tries to give some idea about fields 
where pattern recognition can be applied. A few examples per field are given 
and some short comments. In Verhagen et al (1980) some more details and 
literature references for the examples are given. 


Medical field 

Examples in the medical field are: diagnosis; analysis of cardiograms, and 
encephalograms; chromosome, blood cell, uterus cell or tissue classification; 
analysis of echocardiograms and other echograms; analysis of X-ray images to 
determine tumours, obstructions in blood vessels, shape of stomach wall. A 
great difficulty is to find features allowing a discrimination between normal and 
abnormal. Several systems are used in practice, but many more are still under 
development. It appears to be difficult to compete with the flexibility and 
reliability of humans, and often human labour is less expensive than automatic 
systems. Interactive systems are popular, allowing humans to take care of 
difficult and ill-defined situations, while the automatic system performs the 
routine and bookkeeping activities. Automatic systems have a lead if they can 
produce numerical values for interesting phenomena, for humans cannot do 
that. 


Recognition of cultural patterns 

Reading characters for administration and banking, for the blind and elderly 
people with decreasing vision; search and selection of drawings; information 
retrieval are further examples. Reading stylized and printed alphanumeric 
characters of several fonts is a problem well solved in practice. Handprinted 
characters from a limited number of penmen, written with certain restrictions, 
can be read with much success (95-99.9%, depending upon the restrictions 
and number of penmen). The success is substantially lower when no restrictions 
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are demanded and very many people are involved The problem of reading 
continuous handwriting is not yet solved 


Recognition of human eniironment 

Fingerprints and footprints, scene analysis including identification of objects, 
speech and speaker recognition, remote sensing for geological data, status of 
the crop, pollution detection, production of maps, inventory of (sub)urban sites 
also present pattern recognition problems The recognition of isolated words 
from a relatively small vocabulary (a few hundred words) and from a few 
speakers is possible with a high percentage of success (95-99%) the greater 
the size of the vocabulary and the number of speakers the lower the recognition 
rate Continuous normal speech is only understandable automatically for 
special situations (programming a computer by a few people) Remote sensing 
applications are practical for specialized purposes 

Scientific research 

Analysis of bubble-chamber images, flmisiones, radioactivated mud, geological 
structures, meteorological data, images from materials science provide examples 
here Relatively simple tracks and events in bubble chamber images can be 
determined with a reasonable amount of success Specialized systems are often 
used to determine quantitative data from thin material slices, etc 

Industrial field 

Quality inspection of (fast) moving surfaces of steel, paper, banknotes , recogni- 
tion of objects for sorting, quality control, and assembling, fidelity of loud- 
speakers , analysis of blurred X-ray images and radiograms , testing of materials, 
detection of abnormal, unsafe situations (such as of a nuclear reactor) he in this 
class The applications of pattern recognition m industry are increasing but are 
still in their infancy Section 7 7 deals with some reasons for this situation (see 
also Verhagen, 1977) 


Military field 

Examples m this field are detecting military targets, planes, steamers, etc Not 
many details have been published in the unclassified literature 

7.1 .9 Contents of this Chapter 

Most attention will be paid to sensoK and (pre)processing for two-dimensional 
images within the framework of pattern recognition Section 7 2 gives a survey 
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of some important pattern recognition techniques; given the space available, 
only a general discussion without details is possible. 

Section 7.3 is devoted to several types of sensors for two-dimensional images. 
The data produced by these sensors often have to be (pre)processed, if possible 
on-line. Some requirements and some processors especially suited for pattern 
recognition and image processing systems will be treated in Section 7.4. 

The communication channel and the measuring system introduce distortions 
and noise in the input data of a pattern recognition system. The point-spread 
function of an optical system and the coupled electronic system (for example, 
as found in a TV-scanner) will blur the image and noise may be added. Section 
7.5 discusses some restoration techniques like inverse filtering and noise reduc- 
tion for two-dimensional data structures. In order to find features or to enhance 
them, special processing may be useful both for human observation and for 
automatic recognition; Section 7.6 is devoted to these activities. 

Finally, some trends in the field of pattern recognition and the subjects treated 
in this chapter are given in Section 7.7. 


7.2 SURVEY OF PATTERN RECOGNITION TECHNIQUES 


7.2.1 Introduction 

Many mathematical and heuristic techniques are used in pattern recognition, 
depending upon the problem. In accordance with Section 7.1.5 the rough 
distinction between statistical and linguistic methods will be continued. Only a 
short survey will be given, indicating some major methods used in pattern 
recognition. A slightly more extensive treatment is given in Verhagen et al (1980) 
from which this survey borrowed heavily. 

Many books on pattern recognition give detailed information, for example 
refer to Fukunaga (1972), Young and Calvert (1974), Bock (1974), and Stein- 
hagen and Fuchs (1976) for statistical methods, Fu (1974), Fu (1977), and 
Gonzalez and Thomason (1978) for linguistic methods, and Duda and Hart 
(1973) and Fu (1976) for both; Batchelor (1978) discusses applications; 
Rosenfeld publishes a yearly survey on image processing in the journal Computer 
Graphics and Image Processing. 

7.2.2 Statistical Methods 

The starting point is a set of analog or digital values of k measurements from 
sources or preprocessed measurements giving k features that may be relevant for 
the patterns involved. A k-dimensional vector space can be constructed (feature 
space) where each dimension represents a feature. A point in this space indicates 
a representation of a pattern in a source. The variability of the representations 
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Figure 7 4 Example of a three-dimensional feature space 
x, xj X 3 with four patterns in it each pattern produces a 
cluster of points which indicate equisalent representations 
of a certain pattern 

of the patterns (Section 7 I 2) means that equivalent representations of a certain 
pattern will be indicated by different points in the feature space (Figure 74) 

Intuitnely one expects that sources with equivalent representations of a 
certain pattern will be situated ‘near’ to each other in clusters (compactness 
h} pothesis), and that different patterns be ‘far’ from each other In order to use 
notions like ‘near’ and ‘far’ one has to debne a distance measure Many distance 
measures are m common use (Kanal, 1974), the simplest being the Euclidean 
distance for a continuous space and the Hamming distance (number of different 
bits) for a binary space 

Only in an ideal case will the points representmg equivalent patterns be close 
to each other in small clustered regions as is shown in Figure 7 4, and the points 
representing non-equivalent, different patterns, be in remote regions, this also 
indicates that really relevant features are chosen In most cases the clusters 
o\ erlap, ha\ e irregular shapes, and are only given as a set of points for a learning 
set (Section 7 1 5) 

A still more difficult situation exists when no labelled points of a learning 
set are gi\ en, and one has to analyse the positions of points m the feature space 
to see whether some regularities may be found This situation is similar to the 
analysis of measuring data (now taken as features) ongmated by a physical 
experiment or of economic data Sometimes it is even not known how many 
clusters, if any, are available Now cluster analysis may be helpful Many cluster- 
ing methods ha\ e been developed m many sciences for many purposes (Jardme 
and Sibson, 1971, Exeritt, 1974, Bock, 1974, Hartigan, 1975) Subjective 
judgments, especially when the number of clusters is not known beforehand, 
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Figure 7.5 Dendrogram. Samples 2 and 8 have the smallest 
distance; a clustering in two groups gives the clusters (2, 8, 5, 1, 4, 3) 
and (6, 7, 10, 9); a clustering in three groups gives the clusters 
(2, 8, 5, 1), (4,3), (6, 7, 10, 9), and so on 

are part of the procedure. A simple and often used method may illustrate this 
point. Firstly, a distance measure has to be chosen, guided by a priori knowledge, 
intuition, and/or trial and error. When n fc-dimensional data vectors are available 
(so providing n points in a /c-dimensional feature space), the simplest statement 
is to interpret them as n clusters. Next, the two points with the shortest distance 
are fused to a new cluster, giving n — 1 clusters. This procedure of combining the 
two clusters with the next shortest distance can be repeated until all the points 
form one cluster. To define the distance between clusters with more than one 
point one has to make a choice as to whether the distance between the means of 
the cluster points has to be taken, or the distance between the two nearest 
points in the clusters, or some other distance. The dendrogram of Figure 7.5 
illustrates this method; the duster fusions made successively are indicated. The 
height of the horizontal connection lines gives a measure of the distance between 
the clusters involved. The final degree of clustering has to be chosen again by 
subjective judgment. 

From now on we will start with a learning set consisting of sources with the 
labels of the classes into which the patterns in the sources have to be classified. 
In the ideal case mentioned above, with small clusters per class and the clusters 
of different classes far away and not overlapping, a very simple recognition 
procedure is possible: classify an unknown source according to the cluster in 
which its point in feature space lies. Here it is taken for granted that the learning 
set is representative for all patterns, and that the clusters can be defined from 
them. 

In practice, however, the clusters often overlap. When a statistical approach 
is used the values of the features are regarded as stochastic values because of the 
variations produced by all sources or variability (Section 7.1.2). The method 
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Closs 1 Class 2 


Figure 7 6 Probability density functions / (x) for two classes 
and one feature x x^ is the optimal discnmmatmg value, the 
cross hatched surface gives a measure for the minimum error 


applied depends upon the knowledge available If for each class and for each 
feature the probability density function would be known, together with the 
a prion probability of all classes and the ‘loss* if a sample is classified in a wrong 
class, statistics gives the methods to determine optimal discriminant functions 
between classes, where optimum means a minimum expected loss A very special 
case for two classes and one feature x, with equal a priori probability and equal 
loss, IS indicated in Figure 7 6 In the region where the two density functions 
/i(x) and /aCx) overlap, for instance at point x„ the best decision one can make 
according to the Bayes classification rule is to assign a sample to the class with 
the highest value of the probability density, in this case class 1 (Fukunaga, 
1972) The discrimination value x^ is that value of x in which the two densities 
are equal The cross-hatched surface is a measure for the minimum error If the 
a prion probabilities are p, and pj for the two classes, and if the loss when a 
pattern of class 1 is classified m class 2 is In, and when a pattern of class 2 is 
classified m class 1 is /jj, the best decision (minimum cost) is to assign a sample 
X to class 1 if Pi/ 2 i/i(x) ^ P 2 fi 2 / 2 W« the discriminating value Xj now lies 
where P 1 I 21 / 1 W = P 2 ^ 2 / 2 W Points to the left of Xj are assigned to class 1 
and points to the right to class 2 

For «-dimensional feature spaces similar procedures can be applied For 
normally distributed density functions., quadratic dis criminan t functions are 
the result, linear discrimination functions are obtained in case the covariance 
matrices of the classes are equal 

The probability density functions and the a prion probabilities are mostly 
not known m practice, but have to be estimated by using statistical estimation 
methods If the type of the density function is known, the estimation concerns 
the parameters of the function from the learning sample values If the type is 
unknown a guess may be made, or some non-parametric estimation technique 
may be used The Parzen estimation, by which each object m the feature space is 
represented by a kernel and the class density estimation appears as the average 
over the learning samples, is quite popular (Fukunaga, 1972) The kernels may 
be normal density functions, uniform distributions, and others In addition to 
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Figure 7.7 Estimation of the density function 
from learning samples, using Parzen estimation 
with normal distribution and different width 

the kerneltype, parameter values (for instance, width) have to be chosen— a 
difficult job. When, for instance, for a one-dimensional example (Figure 7.7) the 
normal densities are small, a very peaked estimate will result, broad-shaped 
densities produce a very smooth distribution. Several optimizing techniques 
using different criteria are possible (Duin, 1976). 

The discriminant function between two classes again is determined by the 
above formulation, giving non-linear functions; a linear approximation may be 
useful (Specht, 1967). 

A much more heuristic method does not estimate the density functions but 
immediately refers to the intuitive idea that equivalent patterns lie near to each 
other (of course with a distance measure defined). Now an unknown sample is 
assigned to the class of the nearest neighbour among the learning samples, or to 
the class of the majority of a certain number of neighbours. 

Another heuristic method starts with an adopted type of discriminant function 
(for instance, a linear function) and experimentally adapts its coefficients 
sequentially in order to get the least wrongly classified learning samples as 
possible. Systematic procedures producing convergent results have been de- 
veloped (Nilsson, 1965; Minsky and Papert, 1969; Mendel and Fu, 1970). 

In the preceding methods all features were taken into account at the same 
time. Some other methods use the features successively. This can be represented 
by a decision tree, where in each node a special feature is analysed. Each node 
determines the choice of one of the branches which spring from that node. The 
classification occurs at the bottom; the depth of the branches may be different 
for several parts of the tree (Figure 7.8) (Fu, 1968; Kanal, 1974). 

When a discriminant function has been determined by one method or another, 
It has to be tested concerning the classification error it produces. To use, once 
again, the points of the learning set and count the number of wrong classifications 
IS not advisable ; a too optimistic result will be obtained because the discriminant 
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Figure 7 8 Deasion tree, circles denote 
nodes with decision based on feature x, 
crosses denote the end point of classification 

function IS tailored for this set It is better to use an independent test set but 
also one with labelled points Now the result is too pessimistic, if afterwards 
a discnmmant function is determined by using both the learning and the test 
set (and it is a pity not to use this set also for learning because the greater 
the learning set, the better the result) When all but one point of the learning set 
are used for determining the discrimination function and the one for testing, 
and this one point successively takes the place of all points of the learning set, 
an unbiased estimate for the error is obtained, though with a large variance 
More methods exist (Toussamt, 1974) 

It IS not always necessary to classify all points into one of the classes If a point 
lies far from one of the clusters or quite near a discnmmatmg surface one may 
reject this point, so decreasing the possibility of makmg faults A human 
decision may be made now or the sample may be left out 
Statistical methods may also be used to find important features among the ones 
ongmally used or to find mterestmg linear combinations of the features involved 
A simple method determines for each feature the difference between the class 
means and compares this distance with the vanances of the classes A good 
feature has small variances m relation to this distance (refer to Figure 7 9) In 


/(^} ^f-r) ry) 



Good feature Bod feoture 


Figure 79 A good feature has small variance in relation to the distance between 
class means 
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this approach the relations between features are not taken into account. So, 
when such a relationship exists it is not always true that the m best individual 
features are the best m ones. More sophisticated methods have been developed 
for those cases (Kanal, 1974; Narendra and Fukunaga, 1977). 

Finally, an interesting point has to be mentioned that is also important for 
measurement strategy. It concerns a relation between the number of features 
(measurements) and the size of the learning set. A small learning set does not 
allow one to estimate with a reasonable accuracy many properties of the classes ; 
so only a few simple features are feasible. It can be proved that with a given size 
of the learning set an optimal number of features exist. Both a smaller and a 
larger number of features give worse results (Duin, 1978; Campenhout, 1978). 
This ‘peaking effect’ also indicates that it might not be useful to draw relevant 
conclusions from measuring more and more variables. 

The fuzzy set approach can also be applied for pattern recognition purposes. 
Fuzziness can be introduced at many levels: the labels may be fuzzy, and/or the 
feature values, the classification results, the clusters, and the models may also 
be fuzzy. Many methods are being brought into existence (Backer, 1978; 
Gaines and Kohout, 1977). Indicating a membership value to the class labels 
for all sources of the learning set (a certain member is a rather typical representa- 
tive of a certain pattern or has only a low membership grade) introduces more 
information in the learning set than when only indicating the class; it is to be 
expected that better classifications can result. The way the membership value is 
assigned, however, is often quite subjective, just as are (like in other methods) 
the distances defined in a feature space or the classification rules chosen. Fuzzy 
methods may bea worthwhile help; until now, however, they have not produced 
a final solution for pattern recognition. 

7.2.3 Linguistic Methods 

The features in linguistic pattern recognition are, according to Section 7.1.5, the 
primitives and their mutual relations. A pattern grammar describes the relations 
in an image, just as in linguistics a grammar describes the structure of a sentence. 
A few rules from a linguistic grammar are: a sentence consists of a noun phrase 
followed by a verb phrase; a verb phrase consists of a verb followed by a noun 
phrase; a noun phrase consists of an article followed by a noun phrase, or of a 
noun followed by an adjunct or of a noun, etc. Figure 7.10 gives an example of a 
pattern grammar (Shaw, 1969, 1970); Figure 7.10a shows the primitives, con- 
sisting of directed edges of a graph pointing from its tail node to its head node; 
the mutual relations between the primitives or conglomerations of primitives 
are defined in Figure 7.10b; Figure 7.10c represents some rules for generating a 
pattern with S as a start symbol and A and B as auxiliary symbols; with these 
rules the sentence of Figure 7.10d is derived as follows: S => (S + S) => 
{{A *B) + (A* B)) (((a + b)*{b + a)) + {{a + b) * (b + a))). 
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o-*b heod of a connecled to toil of b 
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[{[a* b)*[b*a))*{{o*b)*{b*-o))) 


Figure 7 10 Example of a pattern grammar (a) pnmitives, 
(b) definition of relations between pnmit>%es or conglomera 
ViWft xfi -pniiTrftT^es stmit Tu’itt lot geiieraTmg a paiVwTi, 
(d) example of a pattern generated by these rules 


Many different types of grammars and pnmiti\es came into existence with, 
for instance, primitives consisting of pixels instead of bnes and with other types 
of rules to handle pixels and hues than in the example There is a great abundance 
of theories about different types of grammars related to the types of rules used, 
for instance, whether the rules are context-sensitive or context-free, whether 
only speaal kmds of rules are allowed or whether no restrictions are present 
(Salomaa, 1973) 
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A grammar may be derived from a priori knowledge of the images involved, 
for example, of well written characters, but sometimes an analysis of a number 
of representative samples of the patterns may provide a grammar (grammatical 
inference) (Bierman and Feldman, 1972; Gonzalez and Thomason, 1978). The 
easy incorporation of a priori knowledge in linguistic pattern recognition is a 
great advantage. 

A priori knowledge and intuition are important for selecting primitives; 
simple and small primitives need many rules to compose an image, but allow the 
formation of many images; more complicated and extended primitives may 
facilitate the composition of certain images in an easy way but their structure 
is restricted. It can be difficult to extract primitives from scanned images. Hard- 
ware solutions to follow lines and segment them into primitives are reported 
(Wolff, 1977); software algorithms may be used to find primitives in digitized 
images. 

Recognition first of all requires finding primitives and their relations. Then 
whether the pattern agrees with a certain grammar has to be analysed. The 
pattern has to be rejected if it does not result from the grammar involved. Two 
methods may be mentioned. In bottom-up parsing one starts with the description 
of the image by the primitives and the relations found, and by applying the 
rules of the grammar in a reversed direction one has to look for consistency with 
the rules of the grammar under discussion. A general procedure for this purpose 
is given by the theory of automata (Hopcroft and Ullman, 1969; Salomaa, 
1973). Images accepted by automata agree with the different types of grammars 
indicated before. 

Top-down parsing generates images in a directed way till the one to be re- 
cognized is composed. A special and simple, but often used procedure uses 
decision trees with a structure like that described in the previous section 
(Figure 7.8), although at each node there is not a statistical decision procedure, 
but a decision as to whether a certain rule is obeyed or not. At a certain node one 
has to decide, for instance, whether two primitives are connected head to tail or 
not; the next node may ask for a decision as to whether some other primitives 
have a specific relation or not, and so on. 

For further literature on parsing see Aho and Ullman (1972, 1973), Fu (1974), 
and Gonzalez and Thomason (1978). 

Until now, only well defined, deterministic images have been discussed. 
Stochastic grammars are appropriate for distorted images if well defined 
distortions are present. The removal of noise and distortions by image restora- 
tion (Section 7.5) is advisable before parsing. This removal will not be ideal, so 
in general, no exact match with the grammar will be found. This difficulty may 
be reduced by allowing some mismatch (Fu and Lu, 1977), but efficient pro- 
cedures are hard to obtain. For corrections of errors in contex-free languages 
see also Thomason (1975). 
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13 SENSORS IN RELATION TO PATTERN RECOGNITION AND 
IMAGE PROCESSING 


73.1 Introduction 

A sensor in a pattern recognition system (refer to Section 7 1 4) connects the 
sources to the input of the recognition s>stem, with its preprocessing and 
classification functions The sensor produces measuring data, but often some 
preprocessing of these data already lakes place m or near the sensor 

Quite often no special sensors are developed for a pattern recognition system, 
for instance normal microphones are used for speech recognition or speaker 
identification, cardiographs are used for electrocardiogram or\cctorcardiogram 
analysis, and normal X-ray ultrasonic, microwa\e or nucleonic systems are 
used to produce images from technical or biological sources that are suited to 
image processing and recognition purposes As most pattern recognition and 
processing systems use digital techniques, the addition or insertion of analog to- 
digilal con\erters, however, is necessary 

In some cases, a modification of existing sensor systems or the addition of 
some data processing in or near the sensor is \ery useful for pattern recognition 
purposes, especially when images are involved Some examples are TV cameras 
with added flexible digitization and storage facilities, flying spot scanners with 
facilities for line following or scanning with different types of grids (rectangular 
or hexagonal) and one- or two-dimensional semiconductor photocell arrays 
with shift and memory facilities 

Finally, more specialized devices, such as laser-beam scanners or light-plane 
scanners, that scan objects m a predescribed way have been developed for 
pattern recognition purposes m order to gel data in a suitable way, or such as 
distance measuring systems (using lasers or acoustics) in order to gel data 
about the depth dimension of objects 

This section discusses some sensors for two-dimensional scanning of objects 
or images often used for pattern recognition purposes Because no special 
aUeniJOD is gi\exi ip iMip-d>mej}SM>na} ^.^xials ao t.be pihpx nf this 

handbook, systems m this field not specialized for pattern recognition or without 
special modifications for this purpose are briefly treated as well The same 
reason also justifies Sections 74, 7 5, and 7 6 of this chapter because in the 
other chapters mostly onc-dimensional. often time-dependent signals are 
treated, it is appropriate to pay some attention here to two-dimensional, space- 
dependent signals and their processing, where quite often similar, but adapted 
methods are used as with one-dimensional signals 

The systems to be discussed in this section all use some kind of radiation, for 
example, optical, ultrasonic or nucleonic radiation Only contact by radiation 
lakes place This implies that, in practice, often a negligible or only a slight 
influence or distortion of the source takes place, and it also allows having a 
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distance— sometimes a long distance— between the source and the sensor. This 
last property allows the collection of data from aircraft or satellites (remote 
sensing), from hot and— for man— dangerous places. 

The radiation involved may originate in many different ways. The objects in 
the pattern source may produce the radiation itself (hot surfaces, infrared 
systems); or they may be illuminated and the reflected radiation used. One 
may choose between parallel measurement (like the eye) and sequential scanning. 
Scanning may be performed at the illumination side, for instance a point or 
light-plane scanning of the object (Section 7.3.6), at the detection side (like TV) 
with an illumination that is as homogeneous as possible, or at both. 

Not all two-dimensional scanning systems could be treated; no attention has 
been given, for instance, to remote-sensing systems, side-looking radar and 
infrared scanners. 

Important specifications for imaging systems are: spatial resolution (in both 
vertical and horizontal directions and depth) and number of pixels (picture 
elements) in these directions ; speed and method of scanning and the property 
of adapting speed, for example, slow scanning to reduce noise; sensitivity as a 
function of space (shading) and range; accuracy; dark current; linearity (both 
as to space and intensity); noise (both in time and space); flexibility and price. 

7.3.2 Mechanical Scanners 

A classical mechanical scanner is the moving-stage microscope (for example, the 
Zeiss SMP cytophotometer) (Ploeg et al, 1974). The microscope stage with the 
preparation to be scanned, is driven by stepping-motors under computer 
control. Stages with steps of 0.25 pm are available. The transmitted light is 
measured by a photomultiplier and digitized into eight or more bits. The size 
of the measuring spot is determined by a diaphragm. A speed up to 1,000 points 
per second is possible. 

A drum scanner uses a rotating drum; an image, for instance a photograph, 
tightened on the drum, is scanned sequentially by a photodiode, which moves, 
together with the illumination, in the direction of the axis of the drum. The 
aperture may be adjustable, say between 25 and 200 pm. The photocurrent is 
digitized into, for example, eight bits. High discrimination, stability, and 
reproducibility can be obtained, together with a good linearity, and without 
shading. The number of points to be scanned in 1 s is of the order of 10,000- 
20,000. The accuracy, and the price, depend on the quality of the mechanical 
construction and of the optical system. 

7.3.3 TV-scanners 

Television is the most experienced and widespread technique for conversion of 
imagesintoanalogelectricalsignals. TV-cameras are produced in large quantities 



298 


HANDBOOK OF MEASUREMENT SCIENCE 


and in many qualities, always at a relatively low price This holds true both for 
the classical electron-beam scanners and for the photo-arrays that have more 
recently entered the television field The former will be discussed shortly in this 
section the latter m Section 7 3 5 

In a TV scanner the source is projected onto a plain target where sequential 
scanning takes place The scanning goes according to a sequence of equidistant 
lines distributed over two frames in such a way that the lines of the second frame 
lie between those of the first frame The process of the two-frame interlaced 
scanning is repeated 25 or 30 times a second In between the lines and the frames 
synchronization signals are inserted The result of the scanning is a one-dimen- 
sional time signal with information concerning the light situation at the target, 
interrupted by synchronization signals The lime between two scans at a small 
region of the target— this region to be considered as a small capacitor— is 
^ or ^ s, during this time the capacitor previously charged by the electron 
beam, is discharged with a current originated by the photoelectric effect and 
related to the light situation at that region The resulting voltage of the capacitor 
depends on the integral of that current during the time mentioned (Polder, 
1967) Fluctuations during this time and noise are reduced by the integration 
process 

In order to process the images produced by a TV scanner, mostly the arising 
analog signal with information has to be digitized often some analog high 
frequency filtering is applied before to reduce noise analog preprocessing is 
sometimes applied, such as gamma correction, cnspening or pre emphasis 
(Fink 1957) The analog to digital conversion to digitize on-line must be quite 
fast if during one line time many conversions have to lake place With 25 scans 
of two frames per second and 625 lines in the two frames the line time is about 
64 /iS, 12 /IS of It is suppressed for synchronization purposes Sometimes only 
part of the line is used for the image in order to obtain a square image A sampling 
frequency for 512 points per line of at least 10’ Hz, and for 256 points per line 
of at least 5 x 10® Hz is necessary Converters with 1 up to 8 bits are used 
though the least significant bits are not always significant If 512 lines are used 
and 512 points per line with 8 bits per point, about 1 3 x 10® bits would arrive 
per frame pair and about 32 x 10® bits per second This speed is too high to 
communicate with most general purpose computers Moving images require 
special purpose preprocessing or less points and bits per point 

Standing images require only one pair of frames, and with 256 lines only 
one frame Many frames can also be used with digitization of only one or some 
columns, this allowing the use of a much slower converter and communication 
channel to the computer 

Instantaneous shots of moving images require a fast converter together with 
a fast memory able to accept the data on-line during scanning of one or two 
frames This memory may also be used as working memory of a special fast 
image processor (Gerntsen et al , 1977) as a buffer to allow a computer to gather 



PATTIiRN RnrOGNlTION 


299 


its contents according to its own speed, and at the same time it may be used for 
permanent display of the captured digitized image. 

A TV-scanner produces several types of distortions and noise. The optical 
and electronic parts have a point-spread function. Shading is a space-dependent 
type of point degradation caused by the position-dependent sensitivity of the 
target. Time-dependent noise is produced by fluctuations of the intensity, target 
noi.se. and electronic noise (especially the first amplifier). A longer frame time, 
allowing slower scanning, can reduce the first two types of noise. Special low- 
noise amplifiers for the first amplification step reduce electronic noise. Averaging 
repeated shots reduces all types of time-dependent noise. Shading may also be 
caused by inhomogeneous illumination and by a non-ideal optical system. 
Shading correction is possible in hardware (both analog and digital) or in 
software; the signal obtained from a blank background with the same illumina- 
tion is used to compensate for shading. 

The .scanning is deterministic as to the succession of scanned points, so no 
addresses of individual points arc necessary. Two-dimensional filtering (Sections 

7.4 and 7.5) requires points from several lines. In such cases it is advisable to have 
(random) access to thc.se points after scanning via some convenient means, for 
instance, by temporarily storing some lines of points in a memory. 

Many commercial digitized TV-scanners with divergent facilities and quality 
arc offered for sale (Hougardy, 1976). 

7.3.4 Flying-spot Scanners 

A flying-spot .scanner (Golab el ai. 1971 ; Eccics ct al., 1976a, b) uses sequential 
scanning of the object at the illumination side. The spot of a high-quality cathode 
ray tube, having a flat window, is focused by a lens onto a transparency or 
photographic negative to be analysed (Figure 7. 11) or onto an opaque surface 
having varying reflectivity. A photomultiplier measures the light transmitted or 
rellcctcd. .As any position of the object may be selected without a fixed order of 
scanning one has random access. In the ease of digitally coded positions, a fixed 
scan grid is mostly used. This may be square, rectangular or hexagonal, and its 
size and pilch can be chosen at will. The size and shape of the sample area can be 
influenced by electronic or optical means, for instance, by focusing or by intro- 
ducing an astigmatism effect. Like TV systems the flying-spot scanner has a 
position-dependent sensitivity (shading) which originates from CRT screen 
inhomogeneity and geometrical and optical characteristics. Part of these error 
sources can be compensated by continuous monitoring of the spot brightness 
by a compensation photomultiplier and by using, for instance, this information 
as a reference of a (huil-slope A!D converter which converts the photomultiplier 
current to time-interval length (Figure 7.12). If this length is measured by 
counting the number of periods of a high-frequency generator a digital value of 
the transmitted or reflected light is obtained. Tlic current of the compensation 
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photomultiplier may be integrated for a certain time in the dual-slope device 
and brought back to zero by the current of the mam photomultiplier This has 
the additional advantage that approximately equal amounts of photons are used 
at different intensities of transmitted or reflected light such that the inherent 
Poisson noise is independent of intensity Consequently, the scan speed now 
vanes from point to point according to the density or reflectivity value to be 
measured and the accuracy desired Typical scan times range from 10 fis to 
1 ms per point fori %accuracy Noiseconsiderations are discussed in Billingsley 
(1975) 

Spatial resolution depends on CRT spot size, optical magnification, and lens 
quality Typical spot size is 40 /im on the screen and 1000 x 1500 sample points 
on a 24 X 36 mm^ negative 


Voltoge on mfegrotor 



< Fixed time interval X Time intervol to be counted > 


— Time 


Figure 7 12 Dual slope A/D converter 
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In the random-access mode the scan can be made to follow lines or edges as 
they appear in the object. This may bring about useful data reduction directly 
at the sensor stage (Wolff, 1977). 

The flow of information may also be reversed when the object is replaced by 
photosensitive material such that the flying-spot scanner constitutes a hard copy 
device. Here the time of illumination depends upon the density desired. 

7.3.5 Photo-arrays 

Semiconductor technology has provided solid-state image sensors (Melen, 
1973; Purll, 1976). Combining photodiodes in an array was a first step. Inte- 
gration of the scanning function has followed since. The charge-coupled device 
(CCD) image sensor is a typical example (Howes and Morgan, 1978). Here the 
light-generated charges are transferred through carefully organized electro- 
static peristaltic movements in semiconductor channels to end up in an high- 
impedance amplifier at the output. On the way the charge may become lost 
or mixed up with charges from other photodiodes. Special buried-channel 
techniques are applied for better insulation but these still fail when leakage is 
caused by intensity overflow for which blooming results. In order to prevent 
smearing the channels are often also shielded from light and thus prohibited 
from acting as photodiodes. 

The simplest CCD consists of a linear channel lined with a row of photodiodes. 
These collect photons over a time period building up their charges until they 
are simultaneously dumped into the channel through which they are lumpwise 
transported to the output. Even such a simple configuration as this line scanner 
is useful in many industrial applications, especially when the product to be 
inspected passes the scanner on its way through the manufacturing process. 

Typical diode size is 20 pm x 20 pm and resolution is of the same order. 
Array lengths vary from 256 to more than 1000 elements. Shading effects may 
have a range of less than 10 %. Calibration of a few single elements can give 
considerable improvement. Noise is typically a few hundreths of a per cent of 
maximum intensity allowed. Acquisition rates well in excess of TV-speeds are 
obtainable under favourable conditions. 

Two-dimensional CCD scanners for television duties have been constructed. 
Array sizes range from 75 x 120 to 256 x 320 elements. Here the transfer- 
channels compete for area with the photosensitive elements. At least two read- 
out methods are in use. The first (interline transfer, ILT) employs a number of 
linear channels as described above. In order to create read-out time, interlacing 
is achieved by alternately dumping charge from one out of two sets of photo- 
diodes in a channel in between the two sets. The second method (frame transfer, 
FT) has linear channels that act as photodiodes at one half of the surface of the 
scanner, but are covered at the other half. Charge dumping now takes place by 
a quick shift of the charge train from the first half into the dark, to be further 
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shifted to the output when read-out demands Here it is separation of charge 
lumps that forbids simultaneous charge collection m the whole area 
Semiconductor and insulating materials are not primarily chosen for their 
optical properties The silicon substrate would be preferable in that respect and 
attempts have been made to thin the chip for back phase imaging Good spectral 
sensitivity between 300 nm and 700 nm can be obtained 


7.3.6 Special Optical Scanners for Three-dimensional Structures 

The scanners treated above produce 2-D projections from 3-D structures 3-D 
information can be found from one 2-D projection using distance cues (shadows, 
perspective, and other a prion knowledge), from more than one 2-D projection 
taken in different directions, and using stereopsis methods (stereology, a field 
intensively treated by the International Society for Stereology (Carpenter)* 
3-D scene analysis attracts much attention (Shirai, 1978) 

A number of scanners with a special illumination strategy and/or range- 
finding facilities have been developed especially for robot systems in order to 
produce data about the third dimension of more or less simple structures in an 
easy way An example of such a scanner uses a light beam with known origin and 
changeable but known direction that scans 3-D structures, together with a TV 
system that determines where the beam is reflected at the structure As point-by- 
point scanning takes a considerable time, a plane of light (also called ‘sheet’ or 
‘silt’ of light) IS generally used, that is, for instance, shifted in parallel by equal 
distances or is rotated around an axis with equally spaced angles thereby 
scanning the 3-D structure A TV system is used again to determine the position 
of the reflected line positions (Shirai and Suwa, 1971, Rocker, 1974) A faster 
method uses the projection of many simultaneous parallel or rotated light 
planes or of a square grid on the structures (Will and Pennington, 1971) The 
evaluation of the television signals is then more complicated than with one 
light plane Figure 7 13 explains the situation where a light plane is shifted 
m parallel across a simple object, a tetrahedron on a horizontal ground plane 
The light plane is perpendicular to the ground plane Intersections of the light 
planes and a plain tetrahedron surface are parallel, equally spaced, straight 
lines with directions and distances depending upon the angle between the hght 
plane and the surface A television camera looking at the tetrahedron receives 
data from the intersections about the position of the surfaces Begin, end, and 
breakpoints of the straight lines are determined in a relatively easy way from 
the video signal From this kind of information, together with data about the 
position of the light plane, the position of the surface can be computed Curved 


•AM Carpenter. Secrelary^reasurer International Soaety for Stereology lU-NWCMed Ed 
3400 Broadway, Glen Park, IN 46408. USA 



PATTERN RECOGNITION 


303 



Figure 7.13 Tetrahedron scanned by parallel light 
planes. Intersections of the light planes and two surfaces 
of the tetrahedron are indicated 


surfaces give curved, and not equally spaced intersections, allowing, by means 
of more complicated computations, determination of information about these 
surfaces. 

The light planes may be generated in several ways. For instance, by light-plane 
projectors rotated by motors, by standing light-plane projectors using rotating 
mirrors, by laser beams shaped to light planes together with laser-beam de- 
flecting devices. Instead of a television scanner a photocell array can also used 
(Kieszling, 1976). 

A different approach uses a type of radar range-finding technique (radar is 
used for long distances) using a laser beam, and measures light flight distances 
for the different light paths by means of time measurements during the scanning 
of the structure surfaces. Pulsed laser methods require measurement of the time 
between transmitting and receiving a pulse. Continuous lasers may also be used 
in which case a light beam from a high-frequency, amplitude-modulated, laser 
scans the 3-D structure, for example, by means of a scanning mirror (Figure 
7.14). The reflected light in a direction approximately opposite to the incoming 
light reflects from another mirror surface and is sensed with a photomultiplier, 
thus producing a high-frequency signal. The amplitude depends upon the diffuse 
surface reflectance of the points of the structure. Phase is related to the total 
light path, so is dependent upon the distance of the mirror to the points of the 
structure and provides range data (Duda and Nitzan, 1976). 

A common point for all these systems is that no shadow effects are measured. 
Preprocessing of the data to reduce noise influences is necessary but the 
identification of edges and surfaces is simpler than with a homogeneous 
illumination. Data about accuracy and other metrological parameters are 
seldom given in the literature; this is not important if recognition of structures 
composed out of (simple) surfaces is the main purpose. 
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Figure 7 14 Block diagram of a range-fioding sjstem 
using an amplitude«modu!a(ed laser 


7J.7 Derices for Detemiintng [eternal Structures 

The optical devices discussed m the preceding sections mostly produce data 
concerning the surfaces or objects, and consequently about the outside situation 
of objects (optically transparent 3-D objects are an exception) To obtain data 
concerning the inside situation of closed opaque objects a radiation source lilce 
a nucleonic, X-ra> or ultrasonic source may be used By absorption or reBec 
tion of this radiation in the speamen an image may arise representing useful 
information about the inside structure This image has to be sensed in an 
appropnate manner Sometimes the sensors immediately produce data that 
can be digitized and used as input data to an image processor and recognition 
system (for instance, ultrasonic data), m other cases an intermediate step has 
to be used (for instance, a photonegative produced by X-rays, that has to be 
handled by one of the sensors desenbed in the preceding sections) This section 
bnefly surveys some interesting systems used to produce images of the internal 
structure of objects or specimens 

7 3 71 Radiograph) 

A radiographic system consists of a nucleonic or X-ray source, an object with 
specific absorption for the structures to be found and a recording medium. 
When no speafic absorption is present onginally taking in of absorbing or 
nucleonic matenal may be a solution as is often practised m medical applica- 
tions In the case of nucleonic take>m material the position of this matenal 
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inside the body and not the absorption properties of the structures is relevant 
and has to be determined. A general difficulty in this field is that the images 
obtained are often of poor quality, noisy, distorted and ill-resolved. Visual 
inspection is sometimes possible by trained observers, but here too the situa- 
tion is not ideal. The automatic analysis of the images to obtain numerical 
data or to recognize the shapes of structures is mostly impossible without special 
measures. Some of the causes for the bad quality of the images are the finite 
source size, the finite duration of flash time of the X-ray source, no lenses are 
available for most cases, the point-spread function of the whole system is far 
from negligible and is space variant, movements inside the specimen or of the 
specimen as a whole may be present (for example, the dynamic action of the 
heart and veins, respiration). Additionally one has to deal with film response 
characteristics, noise, and background effects. Here the possibilities of image 
restoration and enhancement (Sections 7.5 and 7.6) may help greatly to produce 
better images, or indeed to make measurements possible. Interesting applica- 
tions can be found in Hunt et al. (1973). 

Total absorption depends on the absorption along the whole path of the 
rays. Projection in one direction only does not allow the reconstruction of the 
absorptivity as a function of position. Reconstruction is possible if many 
projections from different directions are available, though it needs considerable 
computer power. Many computing algorithms have been produced for this 
purpose— called tomography (Ter-Pogossian et al., 1977). Depth information 
may also be obtained by a stepwise moving pseudo-random coded aperture 
between the object and measuring positions (Koral et al, 1975). 


1.3. 1.2 Ultrasonics 

Ultrasonics, as well as nuclear magnetic resonance (NMR) (see the following 
section), can immediately provide information about single small-volume 
elements, in principle this being without need for reconstruction from pro- 
jections. In the medical field both are welcome low-hazard alternatives to 
nucleonic or X-ray sources. Ultrasound has also found numerous applications 
in material research (for instance, welding crack detection). 

The use of sound for imaging dates from submarine detection (SONAR) in 
World War II. A sound pulse from a transmitter is partly reflected at any change 
of acoustical impedance of the media passed. At the transmitter, now used as a 
receiver, different reflections arrive at different times. Their echo time is a 
measure of the distance along the direction of transmission. The sound pulses 
are repeated periodically. This sonic analog of RADAR can also make use of 
the latter’s display techniques by plotting echo intensity in brightness as a 
function of distance. The result is a one-dimensional section in the direction of 
transmission indicating changes in acoustical impedance. 
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A two-dimensional image can be generated via a slow, scanwise, mechanical 
variation of direction, this bemg applicable for objects that change or move at a 
still slower pace Fast electronic beam steenng through a phased array method 
was de\eloped m the wake of RADAR (Somer, 1968, Thurstone and Ramm, 
1974) A multiple set of single transmitter-receivers, each working on its own 
for Its own tune interval, has been devised as an alternative fast 2 D technique 
(Bom et al, 1973) The movement of heart structures (Figure 715), foetal 
breathing and the like can be studied in this way 

Discnmination is of the order of 1 mm at a frequency of 2 5 MHz given 
favourable conditions The images obtained are often of too low a quality to 
provide automatic recognition of for example, heart structures and to provide 
the determination of quantitative data about heart volume changes or the 
velocity of heart valves in an easy and reliable way This is due to shieldmg of 
ultrasound by large acoustic impedance objects (such as the bone of the nbs), 
small differences m acoustical impedance between blood and tissue, the lack of 
reflection from surfaces not perpendicular to the direction of transmission, and 
noise Image restoration and enhancement techniques, the use of tune informa- 
tion from successiv e images and interactive help from professional cardiologists, 
however, create more and more successful applications (Ckimputcrs m 
Cardiology, commenang 1975, available from IEEE Computer Society) 

A broad review on acoustic imaging is available m Proc IEEE, April 1979 

7 313 Zeugmatography 

Zeugmatography, spin imaging or spin mapping is a very new application of 
NMR (Lauterbur, 1973, Hmshaw, 1974, Andrew et al, 1978) It detects the 
presence of hydrogen nuclei (protons) such as exist in water, fats, and carbo- 
hydrates, but may be tuned to any other type of nucleus that cames magnetic 
spin When a sample is placed in a static magnetic field each of its protons may 
absorb or emit radiofrequency electromagnetic radiation of a well defined 
frequency determmed by the local value of the field induction The gross effect 
in normal circumstances is absorption By special techniques (adding slant- 
shaped constant or varying fields) the induction can be given a unique value at 
a given point (and in a restricted volume around it) The absorption at the 
correspondmg unique frequency then measures the proton density in the v olume 
selected, so producing imaging 

Due to the different environments of protons m fats and water, respectively, 
their absorption charactenstics, m particular the frequency selectivity, are 
different Discrimination between malignant and normal tissue seems to be 
possible m some instances as their environments m relation to fat are different 
Whole body scanners require full body embraemg electromagnets 

Many diflerent techniques have been developed, or are under development, 
m order to provide faster results (pulsed instead of continuous waves), higher 
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Figure 7.15 Echocardiogram produced by a multi- 
scan system. The top trace is an electrocardio- 
gram: (a) original image; (b) some heart structures, 
indicated by lines, found after processing; (c) 
interpretation of parts (a), (b). From left to right are 
shown m succession: 1, the anterior chest and heart 
walls; 2, the right ventricular cavity; 3, the inter- 
ventricular septum; 4, the left ventricular cavity; 
5, the left ventricular posterior wall; and 6, the 
mitral valve leaflets 
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resolution (currently of the order of 0 5-10 mm depending on size and measuring 
circumstances), simpler hardware and software requirements and less distortion 
Some distortion can be removed by restoration techniques Static fields and 
rad^ofrequency waves of NMR are shielded by highly permeable materials and 
good conductors, respectively In medical applications only conduction need 
be considered At 10 MHz the attenuationisacceptable in a body-sized \oIume 

7.4 SPECIAL PROCESSORS 


7.4.1 Introduction 

In most image acquisition, storage, processing, and display systems, the pro- 
cessing part is the throughput limiting factor When conventional computers 
are used, the large number of picture elements (pixels) involved causes the total 
number of memory references, computations and thus, total computation time, 
to be extremely large Even in the case where a small number of relatively simple 
operations per point will suffice, processing by a conventional computer will 
be much slower than the acquisition rate Assuming a TV scanner is used as the 
mput device, even a high-speed conventional computer will perform only one 
single instruction in the time the scanner needs to deliver several pixels 
Although throughput is not the sole factor forjudging an image-processing 
system, in most cases achieving a reasonable throughput is one of the major 
problems Sometimes the need is to process moving images, for example at 
25 per second, which imposes very high demands on the throughput In other 
cases very large numbers of images have to be processed with not too much 
delay, even image processing for application in algorithm design or m research 
often has to be reasonably fast As these systems are mostly used interactively, 
the results of the basic operations should be visible within seconds, thereby 
allowing unconstrained human interaction 
A large number of papers have been published m the past twenty years that 
give designs of architectures better suited to number ‘crunching’ than the 
con\ entional V on Neumann machines, but only lately has the price-performance 
ratio of the components been low enough to realize such machines Flynn (1972) 
gives an attractive review of the faster architectures for general-purpose com 
puters This chapter partly develops the same Ime of thought, but dwells more 
on the specific problems of two-dimensional information processing 

7.4.2 Alternatives for Obtaining Higher Processing Speeds 
Softnare s}stems 

Some systems use no special processing hardware All processing is done by a 
general-purpose computer using software packages written m assembly and/or 
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higher-level languages (Haralick and Currier, 1977; Johnson, 1970). Other 
systems use microcoded software packages (firmware), achieving a speed-up 
of 5-10 when compared to an all-assembler software package (Ito et al, 1978). 
The primary advantage of these systems, their generality, is at the same time 
the most important reason for their lack of speed: most applications of general- 
purpose computers do not impose high demands on throughput, and special 
characteristics of two-dimensional processing have not been used in the design. 


Faster technologies 

The most obvious way of improving performance is, of course, to speed up the 
existing parts of hardware by using the fastest available electronic components. 
The physical constraint on this type of speed improvement (from mechanical 
switches to emitter-coupled logic gives a factor of 10®) is, however, in sight. 


Hardwiring 

Other ways to obtain significant speed increases are hardwiring fundamental 
operations, for instance, adding a hardware multiplier; incorporating special 
floating-point logic; using faster and larger semiconductor memories to store 
image data, thereby minimizing swapping of data to and from disk storage 
during calculations. In many applications these are indeed found to be rather 
inexpensive ways of improving performance without affecting the (conventional) 
system architecture. However, by bringing more parallelism into the system 
architecture, by introducing a slave processor that meets the special demands of 
the most frequently used image processing operations, or by linking two or more 
computers (or microprocessors) for simultaneous operation on different parts 
of the same problem, it is possible to reduce the required computation times 
drastically. 


Parallelism 

Parallelism can be introduced at a number of different levels. Depending on the 
architecture, different jobs (tasks), independent parts of jobs, independent 
programs, independent subroutines, and even independent instructions or parts 
of instructions can be executed concurrently. An important problem is, however, 
the detection of such independent parts of programs or of independent instruc- 
tions. Programs and programming languages have been tuned for execution on 
sequential machines. This is why some experts (see for instance Amdahl, 1967), 
believe that parallel machines have only a limited field of application. If that is 
still true, the field of pattern recognition and image processing is in any case very 
well suited to the application of parallel architectures. Most of the popular 
image-processing operations make extensive use of array operations. In the 
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calculation of different pixels the operations may differ, but the operators will 
be the same all through the image In other words most image-processing 
opierations are parallel operations The different image operations of an 
algorithm are, of course, still performed in senes 

7.43 Parallel Architectures 

The parallel computer architectures that ha\e come of age are processor arrays, 
pipelined processors associative processors, and multiprocessors 


Processor array s 

Processor arrays consist of a number of connected processing elements (PE’s) 
with more or less limited possibilities of communication An important charac- 
teristic of processor arrays is that control is mainly centralized in one single 
array control unit The most popular interconnection patterns are given in 
Figure 7 16 The more complicated schemes (Figures 7 16b, c, and d), showing 
4-connected, hexagonally connected and 8-connected interconnection patterns 
arc the most suitable for image processing 
Ultimately one would like to use one PE per pixel, but because of size and 
economic restraints to date most realizations have been limited to a smaller 
number of PE’s whilst larger image arrays are processed by letting the smaller 
subarray scan the larger (for example 512 x 512) image For the same reasons 
mostly simple, often only one bit w ide, processors are used Arithmetic computa- 
tions are performed bit-senal, but image-parallel Examples are 3 x 3 PPM 
(Kruse, 1973), 8 x 8 llliac IV (Barnes et a!, 1968), 12 x 16 Clip 3 (Duffef al, 
1974), and the, now recently (1979) realized, 96 x 96 Clip 4 (Duff, 1976) 
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Figure 716 Inierconnection patterns of processing 
elements (a) no direct interconnection (b) 4-con- 
nected interconnection pattern (c) hexagonallj con- 
nected inierconnection pattern (d) 8<onnected 
intenronneclion pattern 
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Sometimes the term array processor is used for all machines that are merely 
better suited to perform array operations than conventional computers. 


Pipelined processors 

Pipelining techniques can be applied to use the different parts of hardware more 
optimally. In its simplest form, the fetch of the next instruction can be made to 
overlap the execution of the current instruction. More generally, when pipeline 
techniques are used (the name assembly line techniques would also have been 
appropriate), different parts of the computation are performed currently, but 
different datawords are handled serially. In evaluating, say, an inner product 
of two vectors X(i) and 7(i), as is necessary in the calculation of distances, 
convolutions, and correlations, a pipelined processor (in this example consisting 
of both a multiplier and an adder) concurrently multiplies two operands x(/c) 
and y(k), adds the previous product x(k — 1) ■ y(k — 1) to the previous partial 
sum X?=i^(0'y(0 and at the same time fetches new operands x(k + 1), 
y{k + 1) from memory. Not suffering from size constraints, in pipelined pro- 
cessors one can afford to use sophisticated, very high-speed, logic. Floating- 
Point Systems Inc. AP-120B is an example of a pipelined processor (the manu- 
facturer uses the name array processor that is meant to be used as a mini- 
computer’s slave processor). Further examples can be found in Ledley et al. 
(1978), Haralick and Minden (1978), Lemkin et al. (1974), Asada et al. (1978), 
and Gerritsen et al. (1977). 


Associative processors 

Associative processors are a variation on processor arrays. The PE’s are not 
directly addressed, but are activated if a program-specified match exists between 
a specified number and characteristic data inside the PE. Only the activated PE’s 
perform the current instruction, all the others remain idle (see also Rama- 
moorthy et al., 1978; Pao and Schulz, 1978). 


Instruction streams and data streams 

The three parallel architectures discussed have in common that the parallel 
units are controlled by one single control unit, but perform their operation on 
more than one data stream. This organization is generally referred to as single- 
instruction stream, multiple-data stream (SIMD). From this macroscopic point 
of view most conventional computers have the single-instruction stream, 
single-data stream organization (SISD), while the multiprocessor architecture 
to be discussed in the next paragraph will be classified as multiple-instruction 
stream, multiple-data stream (MIMD). 
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Multiprocessors 

Multiple-instruction stream organizations can be split up into real multi- 
processors, that IS configurations in which a number of complete and indepen- 
dent smgle-mstruction processors on a certain level share memones to perform 
cooperatively a number of rather independent tasks and secondly the multi- 
processor configurations in which so called skeleton processors share a larger 
number of system resources In both cases each of the separate processors has 
Its own program, but on a higher level the configuration is controlled and the 
processors are synchronized by an integral operating system The shared 
memories function as mailboxes for intertask and interprocessor communi- 
cation (Flynn, 1972, Boxer and Batchelor, 1978, Tojo and Uchida, 1978, 
Okada et al, 1976) 


Dra'tsbacks 

All architectures discussed suffer from unwanted effects that hamper the speed- 
up expected In the cases of processor arrays and associative processors the 
length of the vector to be processed (logic size) has to be fitted to the size of the 
array of PE’s (physical size) If the image to be processed is larger than the array 
of processors available, the scanning of the logic array by the physical array 
introduces considerable o\erhead When the size of the \ector to be processed 
IS smaller than the size of the processor array (the worst case being the processing 
of scalar quantities) a number of processors will not be used Similar vector 
fitting problems also arise when using pipelined processors Branches and 
decisions based on calculations just performed also influence the performance 
of SIMD architectures For instance, in the case of pipelined machines the current 
vector operation has to be completed (the pipe must be emptied) before the 
test can be performed and the next vector operations started 
In MIMD systems the most important problem is the overhead caused by 
intertask communication If the separate processors each process a separate part 
of the data of a larger, common problem, then inevitably delays will arise when 
one processor is waiting for another’s results, especially for variable execution 
times per task 


7.5 RESTORATION 


7.5.1 Introduction 

The image used in a pattern recognition system may be degraded The process 
of correcting for this degradation is called restoration If the degradation 
involves only the grey value of a point (pixel) it is called point degradation If it 
also involves the neighbourhood of a point, it is called spatial degradation Some 
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books on image processing, including restoration, are Huang (1975), Rosenfeld 
and Kak (1976), Andrews and Hunt (1977), and Pratt (1978). 

Distortions can be introduced by the sensor of the system (for example, by 
the optical transfer function), by movement of the source, or by atmospheric 
turbulence. Degradation is also produced by all types of noise. The simplest 
case occurs when this noise is additive. When the noise depends on the grey 
values in a known manner a (non-linear) rescaling of the grey values may 
compensate for this. Taking the logarithm of the grey value in the case of 
multiplicative noise results in additive noise. 

Reduction of noise is sometimes as important as the correction for distortions. 


7.5.2 Point-spread Function 

When the degrading system is linear, the blurred image g{x, y), can be given by 
a superposition integral 

g(x, 3') = JJ 

where /(^, tj) is the original image and h(x, y, tj) is the so called point-spread 
function (PSF). 

When the original image is a point source,/(<^, g) is a delta function and the 
resulting image g{x, y) equals the point-spread function. When, at a shift of the 
source, the resulting image g(x, jO is translated but otherwise unaltered, then 

hix, y, g) = h{x - Ly - g) 02) 

and the point-spread function is said to be shift invariant (SIPSF); otherwise the 
point-spread function is called shift variant (SVPSF). 

The superposition integral results in a convolution integral for a shift invariant 
point-spread function 

9ix, y) = JJ fO, g)h{x - ^,y~ g) d^ dg (7.3) 

Fourier transformation of both sides of equation (7.3) gives 

G(u, v) = H{u, v)F(u, v) (7.4) 

Capitalized characters denote Fourier transformed functions. 

Equations (7.3) and (7.4) provide the possibility of reconstructing the original 
image from the distorted one if the point-spread function or the transfer function 
are known (see next section). When the process of degradation is given, these 
functions can be calculated from theory, such as the point-spread function of a 
coherent or incoherent diffraction-limited ideal optical system (Goodman, 
1968), the point-spread function of a moving source (Aboutalib et al., 1977). 
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When the process of degradation is not known, the point-spread function has 
to be estimated from experiments Special sources can be used for this purpose 
such as point line or step sources (Rosenfeld and Kak, 1976, Huang et al, 
197J) 


7,53 Inierse Filtering 

When the degrading system is linear, the reconstruction can be obtained by 
inverse filtering When the degraded image is the result of a spatial invariant 
point spread function, F(ii, t) can be reconstructed according to equation (7 4) 
by multiplying G(ii u) with l/H(u t) In other words, the inverse filter is then 
simply i) Problems arise because H(u, u) will be zero at some points 

{u, t) (limited passband of physical systems) and so, in the absence of noise, 
G(u, t) IS also zero and the ratio G(ii, i)/H(u t) is not defined When H(u, v) would 
never be zero the inverse filler reconstructs the original image exactly in the 
absence of noise 

When additive noise is present equation (7 4) changes to be 

G(i(, t) = H{u, v)Fiu, V) + N{u, v) (7 5) 

in which N(u, i) is the noise term So dividing G(w, u) by H(h, u) gives 


Gju, t.) N(», v) 

H(u, i) H(u. i) 


( 76 ) 


For small values of H(u, t) at points (ii, u), Niu, u) can easily dominate 

F(i/, i) So a reconstruction filter A/(u, u) may only equal !///(», u) for those 
values of (», r) where the signal to noise ratio is sufficiently high, and for the 
other values a choice has to be made (for instance M(u, p) = 1) (Veen et al, 
1978) 

Another approach (which includes SVPSF) makes use of a one-dimensional 
representation f of the digital image f is a vector of length n* when n is the size 
of the square image The blurred image g is then given by 

g = [/i]r + n (7 7) 


in which n is the noise term The reconstructed image is obtained by calculation 
of the pseudo-inverse of the matrix h, or by iterative filtering (Huang, 1975) 


73.4 Linear Least- squares Filtering (Wiener Filtering) 

An optimal reconstruction process can be obtained by minimizing the mean 
squared error between the original image and the reconstructed image When it 
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is assumed that the reconstruction filter is linear shift invariant the reconstruc- 
tion /(x, 3’) is given by 


fix, y) = 'n(^ - C. 3’ - 


I)) d^ d)/ 


(7.8) 


i„ which m(x h) is the reconstruction filter and git, '0 is the dceraded 

image given by 

,,) = j h(^ - r, »? - s)f(r, s) dr ds + n(^, i]) (7-9) 


where ii(^, i/) is the noise term. 

Now £ = (/ - /)^ is given by 


E = ^/(x, jO - j i"(x - 3’ - '/)9(^> ’0 dC 


(7.10) 


and has to be niinimized for the whok ' 

reconstruction filter M(u, v) given by Rosenfeld and Kak (1976) 

M(u, v) = Sfg(tl, V)/Sgg(ll, V) ^ ^ 

Where S„(.,, c) is the epeetta. densrh, of the degrad« 

cross-spectral density of the degrade c eauation (7.1 1) becomes 

and the images are uncorrelated and « has ze > 

IH(», t;)!^ (7.12) 


= WJ) \Hiu, u)P + [S„„(n, vySffiu, i^)] 

where S.. is the spectral density of the noise and S„ is the 
image. The same result is obtained when/has a zero ”7" 

grey value tor physical systems IS positive a su^btraetion of 

then necessary. When no noise is present the & 

M(ii,v)= l/H(u,v). 193 it is necessary, for Wiener filtering. 

As appears from equations (7. 11) and (7. 1_) It IS 3 i-nr^wn nnt least 

that the spectral densities of both the noise and the image hh^n 

that cood estimates arc available. A further discussion or (1975) 

given by Pratt (1972), Pratt and Davarian (1977), and Ahmed and Rao (1975). 
The case where /i(x, y) is stochastic is described by epian 

7.5,5 Discussion 

Methods like inverse filtering and Wiener filtering can be 
Fourier transformations (DFT) of the degraded image, mu ip i 
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Fourier domain and again a DFT to the spatial domain Direct deconvolution 
m the spatial domain is also possible (with regard to computation time) when 
only a limited number of neighbours of a point is taken into account in the 
deconvolution (windowed filters) Special hardware to achieve convolution is 
mentioned m Section 74 

The digital Fourier transform will generally be computed by the fast Fourier 
transform (FFT) (Cochran et al, 1967) 

Besides the methods previously mentioned an abundant literature of re- 
construction filters exists (Huang, 1975, Andrews and Hunt, 1977, Rosenfeld 
and Kak, 1976) An example is a recursive method in which the dependence of 
the image points is modelled by a Markov mesh (Kalman filtering) (Nahi and 
Assefi, 1972, Habibi, 1972, Jam, 1977. Rosenfeld and Kak, 1976) 

A general problem of reconstruction filters is the assumption that the statistics 
(or spectral densities) of image and noise have to be constant over the whole 
image This is not true in general For instance, images used m pattern recogni- 
tion may consist of objects and background with different statistics, separated 
by edges These edges are important for recognition The strate©' adopted 
should, for instance, first subdivide the image into its constituent parts before 
filtering, this subdivision, however, depends upon the degraded image (Nahi and 
Habibi, 1975. Pratt, 1978) 

Another problem lies in the least-square error criterion Although this is a 
con\eniently objective criterion, an image reconstructed by it may not bcjudged 
by a human observer as optimal A human observer usually favours a noisy 
image with sharp edges more than a less noisy one with faint edges, as is produced 
by least-squared error filtering It is important to take into account models of 
the human visual system to obtain measures which agree with the opinion of the 
human observer (Stockham, 1972) 

7.6 ENHANCEMENT 


7.6.1 Introduction 

In the previous section restoration techniques were discussed that can be used 
to correct degradation present in the image When pattern recognition is the 
purpose of image processing certain degradations can, in fact, be very helpful 
For instance, the concept of different objects present m a background of an image 
is of primary importance m pattern recognition So an operation which en- 
hances the edges defining the objects is often necessary 
Grey-value rescaling is important not only because it may enhance the 
contrast in a particular grey-\ alue range, but also because it gives the possibility 
of obtaining equally probable grey values This is necessary in the computation 
of some texture parameters (Harahck et al , 1973) 
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A third kind of operation starts with so called segmented images, images in 
which, for instance, objects are separated from the background. Several tech- 
niques exist, for example, to eliminate noise and to close gaps. 

7.6.2 Grey-value Rescaling 

Rescaling of the grey values can be applied to enhance the contrast in the image 
or to compensate for grey-scale characteristics. Besides stretching and com- 
pression the grey values can be requantized to obtain a flat histogram of re- 
quantized grey values. When, after requantization, K grey values must be equally 
probable, the cumulative histogram of the (old) grey values must be divided 
into K equal parts. However, because the cumulative histogram consists of 
discrete steps (the grey values are discrete), requantization results in not exactly 
equal quantization steps in the cumulative histogram. Several methods exist to 
reduce this requantization error (Troy et ai, 1973). 

7.6.3 Spatial Grey- value Operations 

Certain spatial frequencies may be important in the recognition of objects; 
these spatial frequencies can be enhanced by filtering. For instance, chromo- 
somes can be recognized from their banding pattern. The spatial frequencies of 
the bands are known and can be used in a filter to enhance the banding pattern 
(Granum and Lundsteen, 1977). Low-pass filtering can be applied to reduce 
noise. Band-pass filtering and high-pass filtering can be applied to enhance 
edges. As with use of restoration filters, filtering can also be achieved by using 
Fourier transformation or by direct convolution with (windowed) filters in the 
spatial domain. 

There is also a considerable interest in non-linear filter methods. These give 
the possibility of suppressing noise while still preserving sharp edges. Some 
examples are median filters (Huang et ai, 1978), where the median of the grey 
values of a window around a point is taken as the new grey value of that point 
(Figure 7.17) and the edge-preserving filter (Nagao and Matsuyama, 1978). In 
this latter filter, for each point the variance of the grey value is computed in 
differently oriented windows and the new grey value is the mean grey value of 
the direction with the lowest variance. Another variance filter is given by 
Kuwahara et ai (1976). 

Special edge-locating filters are developed by Hueckel (1971, 1973) and 
Persoon (1976). Objects with a fixed shape or a fixed grey-value pattern can be 
located by matched filters (Rosenfeld and Kak, 1976). Straight lines in an image 
can be found by the Hough transform (Duda and Hart, 1972; lannino and 
Shapiro, 1978). The Hough transform can be extended to certain other param- 
etrized curves (Bazin and Benoit, 1965; Wechsler and Sklansky, 1977; Shapiro, 
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7 6 4 Segmented Image Operations 

A segmented image is an image which is divided into its different constituent 
image parts An image can, for instance, be divided into objects and background 
In this case the segmented image is a two-valued (binary) image 1 for points 
belonging to an object and 0 for points belonging to the background Segmenta- 
tion of an image can be obtained by thresholding the grey value (Rosenfeld and 
Kak, 1976) by edge detection (Hueckel, 197l,Persoon, 1976) or by region grow 
ing (Brice and Fennema 1970, Zucker, 1976) 



Figure 7 17 Example of median filtering (a) an image de 
graded by shot noise Ten per cent of the pixels contain the 
maximum intensity value (b) Image after median filtering (with 
a 3 X 3 neighbourhood) 




Figure 7.17 
noise pixels 


-) Image when it is filtered a second time. All 
ive now disappeared but the edges are fainter 


Objects smaller than a certain size can 

all object points which have neighbours be ong g determined by 

assigned to the background. The size of 1 S. Small 

the number of times the erosion was appHed be o e the o^ect van ^ 

bridges between objects will also disappear with this operation, B 

X'e reverse operation is called dto, on and is 

rgSToTat oXt.S™tX"iroX.a are hlled in w 

"rap^XIittion „ times after „ X'-'X^XSts^ SXd 

times erosion after n times dilation (called c/osnig), pans') but now the 

as mentioned above (elimination of small , a further discussion 

remaining objects again have about their original size. Fm a 
see Herstnt « »l. (1976). Another imponant the 

(Hilditch, 1969; Stefanelh and Rosenfeld, 1971). ^nis is o. , ^ ^j^ation 

restriction that the connectedness ofthe objects does no c a g • oj-iginal image, 

results in one point thick lines, with the same connectivity as t g S 

Figute 7.19. 5oisy contours of objects may lead « “—feld and Davis, 
skeleton. Several methods exist to reduce this e ( vertices they 

1976). Skeletons are important because, by their end points and vertices, y 

easily reveal how an object is connected. ..vtpnded to three 

Erosion, dilation and skeletonization techniques can 
dimensions (Lobregt et al., 1979). 







Figure 7.18 Example of dilation; (c) ‘=’^°"^°^°^®^®osome7are 
plementary dilation applied. All gaps ® 

filled in (white pixels of (d)) 
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Figure 7 19 Example of skeletonization Skeletons 
of the chromosomes of (a) are shown in (b) 


7.7 TRENDS 

It IS expected that the fields of pattern recognition and measurement will grow 

closer together for the following reasons 

(a) Pattern recognition often requires physical measurements m order to 
gather data from the sources with patterns as input signals for the recog- 
nition system In choosing special sensors one may sometimes avoid part of 
the preprocessing of the input data 

(b) The ultimate aim of measuring is often a classification or decision based 
upon the input data, for example, to distinguish between certain situations 
of the sources or to draw certain conclusions about them given the shape 
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of measured curves or a certain relation between input data. It is, therefore, 
quite natural that a pattern recognition procedure follows the measuring 
stage. Human creativity will mostly be necessary to interpret measuring 
results in the context of research. But in more standardized conditions, for 
instance during production, control of quality or of information content, 
an automatic classification will be appropriate. 

(c) Advancing technology opens up possibilities for interactive design of 
processing methods for both measuring and recognition purposes. More 
complex measuring devices (such as echocardiographs) create a need for 
pattern recognition. Besides, miniaturized pattern recognition systems will 
become available to combine with measuring apparatus in order to 
provide data reduction, for example, for remote sensing and meteorology. 

Real applications of pattern recognition systems are gradually gathering 
momentum, but progress is a little slower than was predicted one or two decades 
ago. A few reasons for this latter statement are: 

(a) The choice of good, relevant features turned out to be very difficult in 
fields like character recognition, speech analysis, remote sensing, and 
biomedical engineering. Very many different choices are possible and are 
made in practice. Useful knowledge about human visual or auditive 
activities is only available in a very limited way and has not lead very often 
to the choice of appropriate features for automatic systems. 

(b) After choosing features, much knowledge has then to be collected about 
their statistical or linguistical properties in order to be able to successfully 
apply one of the great variety of decision algorithms. Extensive and repre- 
sentative learning sets or data banks are necessary for this purpose (and 
also to compare results with other research workers) but they are difficult 
to obtain. So the quality and relevance of the features chosen cannot be 
tested easily; which results in a long-continued use of inappropriate 
features. 

(c) Many practical problems involve images and the processing of images, 
and consequently the use of very many bits. Conventional computers are 
not well suited to this job and are too slow when on-line processing is 

fdl recognition and industrial control, 

te reliability of pattern recognition systems, the flexibility to adapt to 
problems a little different from the original ones, and the level of main- 
let "'^re often too low for practical use. 

c cost of pattern recognition systems was often too high to compete with 
luman beings, who are very experienced in recognizing patterns and who 
(f) flexible, even with limited education. 

1C introduction of pattern recognition systems, with all the consequences 
fr the people involved, has not always been done in a sufficiently careful 
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The preceding enumeration indicates some difficulties but also suggests t\hy 
It ma\ be expected that automatic pattern recognition will gradually be applied 
to more tasks 

(a) Much research has been done m the last decade and is still being earned 
out More knowledge about relevant features for certain fields of appb 
cations is becoming asailable gradually 

(b) Extensa e data bases are coming mto existence, for example, those con 
cemmg alphanumeric characters, speech, and chromosomes Though the 
situation IS unsatisfactory m many fields, this problem is recognized and 
progress is bemg made It will become gradually less difficult to obtain 
learning sets in a number of fields 

(c) A great number of special data processing systems and components are 
becoming a\ailable that use microprocessors, pipeline architecture and 
some kmd of parallel processing of data These systems attain much higher 
speed and allon on line processing 

(d) These hardware de\ elopments may also enhance the reliability of recogni- 
tion systems and modular designs will enable maintenance in an easy nay 

(e) As the price of digital LSI components is reduced, along nith an increase 
in capabilities, potential application of ntw systems will improve Increas 
mg salary costs and the unwillingness of people to perform routme to 
spection tasks will also pro\ide better prospects for automatic recognition 
systems 

(f) As It IS well known now that the introduction of automatic systems needs 
\ery careful preparation and introduction, jt may be expected that less 
failures will occur m the future 

A special point may be mentioned separately It became clear m the past 
penod what the difficult problems were and which were solvable Knowing that 
reading continuous wnting and unrestricted handprinted characters is difficult, 
one can pay attention to what type of rcstnctions m wntmg will be useful and 
acceptable to make automatic systems a success (Suen et al^ 1978) The idea is 
that people will be motivated to adapt to these restnctions if this facilitates the 
speed of banking operations or post handling Speech understandmg of con 
nected words, an extensive vocabulary, and many -speaker situations is another 
difficult area. This may be avoided by streamlining a dialogue with a machine 
which only asks specialized questions with only a few possible answers and by 
repeating the question w hen the system did not understand the answ er correctly 
Such question-answering systems can be used for special tasks bke seat reserva 
tions in trams (Shikano and Kohda, 1978) 

A last optimistic aspect is the intensive cross-fertilization between methods 
and experiences from several subfields of pattern recognition and image pro- 
cessing Combined use of statistical, fuzzy, and hnguistical features and methods 
IS becoming popular Experience m one of the fields of biomedical engineenng. 
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character recognition, speech recognition, remote sensing or industrial applica- 
tions may be useful also in other fields. 

To summarize it can be expected that the field of pattern recognition in 
combination with the measuring field and image processing will gain momentum 
for practical applications in the future. 
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Chapter 



A. VAN DEN BOS 


Parameter Estimation 


Editorial introduction 

Given a measurement situation where it is known that a signal of a certain mathematical 
form exists buried in random noise, how can an estimate of the parameters, the coefficients, 
of the mathematical model be realized?This chapter provides a review of the mathematical 
philosophy and procedures with which such parameters can be estimated through applica- 
tion of methods of curve fitting to systematic data perturbed by stochastic processes. The 
mathematical expressions so realized do not necessarily describe the true internal physical 
behaviour of the process; that is, the estimated coefficients are not necessarily identifiable 
as real physical parameters. The output of the model so produced will, however, adequately 
describe the performance sought. 

The subject matter of parameter estimation rests heavily on advanced mathematical 
material that has come into extended application over the past two decades. This came 
about because of the comparative ease with which the complex and lengthy mathematical 
operations needed can now be handled by digital machine computation. In order to 
provide a concise review, the author has needed to assume that the reader is familiar with 
determinant algebra, probability theory, discrete function mathematics, and spectral 
analysis in general. In addition to the references cited in the text readers will find relevant 
material in Griffiths ef al., ( 1 973), Schwartz and Shaw (1975), Van Trees ( 1 968, 1 97 la, 197 1 b). 


8.1 INTRODUCTION 

This chapter discusses application of statistical parameter estimation techniques 
to physical observations. A common characteristic of these techniques is that 
use is made of a more or less detailed parametric mathematical statistical model 
of the observations. The quantities to be measured are the parameters of this 
model. 

The choice of model for the observations is closely connected to the objectives 
of the parameter estimation procedure. If the purpose is measurement of 
physical parameters the model of the observations arises from an, often 
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detailed, physical analysis and is a careful description of the physical process 
that IS supposed to generate the observations The parameters then have a well 
defined physical meaning An example is the model of the observations used m 
radioactive decay measurements This is a weighted sum of exponentials The 
weights and the decay constants are the parameters to be measured The decay 
constants are specific for a particular radioactive component, while the weights 
are measures of component concentration Models of observations used for 
measurement of physical parameters must beset up with utmost care The reason 
is that systematic deviations of the observations from the model generally lead 
to systematic deviations of the measured parameters from their true values 
An example of a systematic model error is using a bi exponential radioactive 
decay model for observations which are, contrary to this assumption, tn 
exponential This example illustrates that models of observations used for 
estimation of physical parameters must be complete in the sense that they must 
mclude all major systematic contributions As a result the establishment 
of a sufficiently complete parametnc physical model often requires a substantial 
expert knowledge and is a rule mere demanding than the selection and imple- 
mentation of a suitable parameter estimation procedure A much simpler class 
of parametric models may be used if the purpose is accurate quantitatite 
description of the observations as a function of a, usually relatively precisely 
known, independent variable These models are referred to as curve fitting 
models They are often only loosely connected to the actual physical process 
generating the observations Consequently it is not unusual if their parameters 
have no clear physical meaning Well known examples are calibration curves 
obtained by fitting a linear or higher degree relationship to the observations 
Further important examples are the modem dynamic parametric models for 
stochastic processes discussed in Section 8 4 Here the stochastic process is 
modelled as white noise that has passed a dynamic system described by a linear 
difference equation The purpose of this model is not to explain the mechanism 
that generated the stochastic process The purpose is a quantitative description 
of the spectral properties of the stochastic process in the form of a relatively 
^maJU jjumbfj- of dAffwaw ctfeMo.'SJiJs Th:s xosy be for 

automatic control purposes or simply from data reduction point of view Models 
used in practice may also be mixtures of curve fitting models and physical 
models For example, in X-ray spectroscopy the measured spectrum is often 
modelled as a sum of gaussian peaks of unknown height, width and location 
These parameters have a clear physical meaning On the other hand, the un 
desirable background radiation in the measurements is usually modelled as a 
polynomial, which is a curve fitting model Also, m experimental physics, 
physical models are used which contain a number of deliberate simplifications 
and approximations For example, m dynamic modelling, models resulting from 
conservation laws may be complicated and nonlinear They may, as a result, be 
very difficult to verify experimentally Quantitative analysis and linearization 
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may then result in a simplified model which is suitable for experimental valida- 
tion. 

It is important to realize that many parameter measurement methods used in 
experimental physics have been developed in an era when digital data acquisi- 
tion and processing were non-existent or too expensive. As a result, in these 
conventional methods emphasis is on computational simplicity. Needless to 
say that this emphasis leaves little room for other considerations, for example, 
precision or numerical properties other than simplicity. A further striking 
characteristic of many of these conventional methods is that the observations are 
considered exact. The model of the observations does not include a model of 
systematic and/or non-systematic errors. Illustrative examples of conventional 
methods are found among graphical techniques, for instance, determination of 
time constants of linear dynamic systems using the asymptotes in the Bode 
diagram. Further examples of conventional methods are discussed in (Van den 
Bos, 1977). A disadvantage of conventional methods is that systematic and non- 
systematic errors in the resulting parameter measurements are difficult or 
virtually impossible to compute. This complicates a comparison of different 
methods both with respect to accuracy and precision. Moreover, these methods 
are often subjective. There is no clear-cut, objective procedure according to 
which the experimenter measures the parameter. Furthermore, the absence of 
errors in the model of the observations precludes the use of a priori knowledge 
about the errors for improving the precision with which the desired parameters 
are measured. Of course, these critical comments on conventional procedures 
are less relevant if only a rough estimate of a parameter is required. They should, 
however, be kept in mind whenever an efficient use of the available observations 
is to be made. 

Modern statistical parameter estimation methods require a model of the 
observations that includes a model of both the systematic errors and the non- 
systematic errors. An example of a model of systematic errors is the background 
model in the above X-ray spectroscopy example. Non-systematic errors are 
modelled as zero-mean stochastic variables or, if their chronological ordering 
is relevant, as stochastic processes. Thus a particular set of observations is 
considered to be a set of observations made on stochastic variables or to be a 
realization of a stochastic process. Then the parameters to be estimated are 
parameters of probability density function defining those stochastic variables 
or stochastic processes. Thus the measurement problem is reformulated as 
estimation of parameters of probability density functions from observations of 
the corresponding stochastic variables. It has, therefore, taken the form of a 
statistical parameter estimation problem. In statistical parameter estimation the 
function of the observations that is used to compute the parameter is referred 
to as estimator. Thus an estimator is a stochastic variable. The value produced 
by an estimator for a particular set of observations is called an estimate. An 
estimate is a number. 



334 


HANDBOOK OF MEASUREMENT SCIENCE 


The formulation of a measurement problem as a statistical parameter 
estimation problem implies that for its solution use can be made of the extensive 
collection of theories and methods available not only from pure mathematical 
statistics, but also from automatic control, econometrics, biology, and other 
fields An outstanding text on stochastic variables and processes is Papoulis 
(1965) Authoritative in the field of mathematical statistics are Cramer (1961), 
Kendall and Stuart (1966, 1967. 1969), while for an excellent introductory 
text the reader is referred to Mood etal (1974) Texts such as Eykhoff (1974) and 
Goodwin and Payne (1977) are mainly devoted to estimation of parameters of 
dynamical systems An extensive collection of papers on the same subject is 
found m IEEE (1974), IFAC, (1967, 1970, 1973, 1978, 1979) 

The statistical parameter estimation approach offers a number of advantages 
to the experimenter that are difficult to obtain if the conventional methods 
mentioned above are used In the first place the expectation and the variance of 
an estimator can be computed Then the bias, defined as the difference between 
this expectation and the true value of the parameter, represents the systematic 
error, that is, the accuracy of the estimator Bias, although systematic itself, may 
be a result of non-systematic errors in the observations Similarly, the variance, 
or rather its square root the standard deviation, is a measure of the non 
systematic error, that is, the precision of the estimator Bias and standard 
deviation are objective quantities suitable for comparison of different esti 
mators of a parameter applied to the same observations Simple examples 
of the computation of bias and standard deviation are discussed m Van den 
Bos (1977) Furthermore, statistically modelling the observations enables the 
expenmenter to find relatively precise or even the most precise estimator for a 
particular problem This is discussed in Sections 8 3 and 8 4, for static and 
dynamical models respectively Also, if the a prion knowledge about the non- 
systematic errors m the observations is sufficiently detailed, a lower bound on the 
vanance of any unbiased estimator of a particular parameter, for a given set of 
observations, can be computed This lower bound, the minimum variance 
bound, IS discussed in Section 8 2 It enables one to investigate the feasibility 
of a set of observations for the parameter estimation objectives concerned 
Finally, a statistical model of the observations may be used for minimizing 
or reduang the variance of an estimator through experimental design, that is, 
through manipulating independent vanables that can be freely chosen This is, 
very briefly, discussed in Sections 8 3 and 8 4 
This introduction is concluded by the observation that parameter estimation 
methods are a useful addition to, and no substitute for, classical techniques 
for reducing systematic and non systematic errors For example, it is much 
better to avoid systematic errors m the observations than to include them in the 
model of the observations m the form of a parametric model In the latter case 
the simultaneous estimation of the additional parameters increases the variances 
of the estimates of the remaining parameters, which were the primary objective 
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Similarly, reduction of non-systematic errors in the observations is worthwhile, 
since the variance of most parameter estimators is proportional to the variance 
of these errors. 


8.2 PRECISION 

In this section the minimum variance bound (MVB), also called Cramer-Rao 
lower bound, is discussed. The MVB is a lower bound on the variance of any 
unbiased estimator. Since it is independent of any particular method of estima- 
tion, it provides a bound on the precision that can be achieved given the observa- 
tions. The MVB is, therefore, a useful tool for investigating the feasibility of the 
observations for the parameter estimation objectives concerned, the more so as a 
class of estimators exists which, at least asymptotically, achieve this bound. 
These estimators are discussed in Section 8.3 of this chapter. In the following 
the underlined characters, e.g. Wj, are stochastic variables. Vectors or matrices 
are shown in bold face type, e.g. x. Before proceeding the notation that is used in 
what follows should be explained. 

Let X and y be X x 1 and L x 1 vectors respectively and let/(x) be a scalar 
function of the elements of x. Then; 

(a) the 1 X X vector df(x)ldx is defined by its m element df{x)ldx„; 

(b) the X X X matrix d'^f (x)/5x^ is defined by its m, n element d'^f (x)/dx„ dx„ ; 

(c) the L X X matrix dy/dx is defined by its m, n element dy„/dx„. 

Let X = (xi • • • Xfc)^. Then the X x X covariance matrix cov(x, is defined by 
its m, n element cov(x„, x„). 

The MVB may be described as follows. Let Wj, • . . , yv^v be the observations and 
define w as the column vector of these observations. Furthermore let the prob- 
ability density of w be/(o}; 0) where the elements of ca correspond with those of 
w and the elements of the X x 1 vector 0 are the unknown parameters. Suppose 
that the elements ri(w), . . . , rj(w) of the J x 1 vector r(w) are unbiased esti- 
mators of the functions Pi(0), . . . , Pj(0) and define p(0) as (pi(0) ■ • • Pj(0))’^. In 
addition, define the information matrix M by 

M = £[(5 In L/dQy{d In L/50)] 

where L = /(w; 0). Then under a number of, not too restrictive, conditions the 
Cramer-Rao inequality (Zacks, 1971) states that 

cov(r(w), r(\v)) ^ [dp(0)/d0]M"^[5p(0)/50]’^ 

expressing that the difference between the positive semi-definite left-hand and 
right-hand members is positive semi-definite. The right-hand member defines 
the minimum variance bound (MVB) or Cramer-Rao lower bound on the 
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covananceofany unbiased estimator of p(0) It can be shown that undercerlam 
rcgularitj conditions M may aUematisel} be written 

M = -£(d^ In L/cQ^) 

This form is often easier to compute With respect to the Cramer-Rao m 
equality the following remarks can be made 

(a) It can easily be shown that the variances of the elements of r, which are the 
diagonal elements of cox(f, r) cannot be smaller than the corresponding 
diagonal elements of the MVB 

(b) lfp(0) = (^i 0*)^ the MVB simplifies to M“ * 

(c) The /th diagonal element of the MVB, that is the MVB on the variance of 
r, equals 

idp,/c0)\\ '(dp,/c0)^ 

SinceM ' istheMVBforG itisconcludedthatthcMVBforafunctionofO 
follows a simple first-order error propagation law 

(d) Suppose t IS a biased estimator for 0 Let the bias be b(0) Then r is an 
unbiased estimator for 0 + b{0) Hence the MVB for t is 

{I + [cb{0)/c0]}M ‘U + [ab{0)/c0])T 
where I is the identity matrix 

(c) Suppose that of the parameter vector 0 = (Oj 0^ only one 

clement, , is unknown Then the MVB on the variance of an unbiased 
estimator r» oft?* equals l/£[(d In Next assume that all elements 

of 0 ha\e to be estimated and let the corresponding information matrix be 
M Then the k , k element of M is described by nik » = £[(5 In )^] 
It can be shown that the product of the k th diagonal element of a positwc 
definite matru and the corresjwndin^ diagonal clement of its inverse is 
larger than or equal to one Equality only occurs if the diagonal clement is 
the only non-zero element in its row and column It is concluded that the 
k,k clement ofM ‘ is larger than or equal to l/oikt So, as compared with 
the one parameter case, the MVB generally increases 

This section is concluded by three simple examples which illustrate a number 
of aspects of the above theory 

Example 8 I Esumatjon of ihe slope of a straight line Suppose that the 
parameter a is estimated from the observations (ji|, Xj), ,(h.v.JCv) described 
by the model 


li. = ax, + i. 
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where the v„ are independent and identically distributed (iid) and are normally 
distributed with expectation = 0 and standard deviation <j(N{ 0, a)). Then the 
joint probability density of Ui, . . . , % is described by 

/,(o) = exp^-iff-2 ^ ^2^ 

where u = (o^ • • • Hence the probability density of w^, . . . , vv^, is 

fyXo); a) = det(5«/5o))exp^— 50 -“^ ^ (co„ — 

where to = (coi and 5u/5o) is the Jacobian. Since (o„ = ax„ + v„, 

dv/d(o = I and hence det(dt)/5o)) = 1. Therefore 

In L = ~jN ln( 27 t) — IV In tr — icr~^ ^ (w„ — (xx„)^ 

n 

Then 


— E(d^ In Lida?) = a ^ ^ 

n 

and therefore the MVB for a is x^. 

Example 8.2 Estimation of the parameters of an exponential model. Let 
(vvj, Xj), . . . , (wjy, Xjv) be observations described by the model 

= E 4 exp(-/r^x„) + v„ 

k 

where 0 = (Aj • • • /ti • • • Pk)^ are the unknown parameters, the x„ are exact 
and the v„ are iid and N(Q, a). Then using the same argument as in Example 8.1 
one obtains 


In L = - iAT ln(27t) - N ]n a ~ Y. ^^(6) 

n 

where 

dM = w„ - E ^(c expi-pkX„) 

k 

Then partial differentiating, taking expectations, and taking into account that 
-E[ 4 ( 0 )] = 0 yields 

-E{d^ In L/dppdpg) = E exp[-(//p + /t,)x„] 


-E(d^ In LldXpdX^ = ^ E exp[-(Atp + /t,)x„] 

n 


- E{d^ In LldXp dpg) = -a ^ Y.XpX„e\pl-(,Pp + /i,)x J p,q= U...,K 
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These expressions define the information matrix M and therefore the MVB, 
M"' 


Example 8 3 Influence of the estimation of an additional parameter on the 
MVB Suppose that the parameter p has to be estimated from the observa 
tions (iVj, X,), , (wn, Xs) and that the model of the observations is described by 

w„ = exp(-^xj + 7 + 

where the x, are exact, y is an unknown offset and the are idd and N(0, a) 
Note that, smce y is unknown, both fl and y have to be estimated although y is 
not of particular interest 

Using the expressions of Example 8 2 one obtains for the elements of the infor- 
mation matrix of 0 = y)"^ 

-E(8^ In Udp^) = X exp(-2K) 

-£(5^ In ^x„ expC— 2^x„) 

-E{d^ lnydy^) = N/a^ 

If for example ^ = I, x, = 0 In and N - 75, these expressions yield 


,/ 2 500 

“9942\ 

. .. , ,/0 846 

01I2\ 

[-9942 

75 ) 

and 

0028/ 


The MVB for estimation of is therefore 0 S46o^ 

Next assume that, as a result of a different experimental set-up, no offset in the 
observations occurs Then the model of the observations is described by 

w„ = exp(-px„) + L„ 

So only p need be estimated Then the information matnx is scalar and is equal 
to the element mi i of the above M Hence in this case the MVB for estunation of 
P IS 1/mii = 04ff^ 

Discussion of the examples From its definition it follows that the MVB can 
only be computed if the probability density of the observations is known For 
mstance, m Example 8 1 use is made of the observations being independent 
and normally distributed with known variance The resultmg MVB is not a 
function of the unknown parameters and can, therefore, be computed before- 
hand for any set of values of the independent variables Xj, , x^ Furthermore, 
if, within physical restrictions, the independent variables may be freely chosen, 
they may be chosen so as to minimize the MVB For example, if the restriction 
is|x„| < 1, the MVB is minimized if onechooseslx„| = Iforalln In the litera- 
ture a particular choice of independent variables is referred to as experimental 
design For a general discussion of experimental design see Fedorov (1972) 
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In Example 8.2 the computation of the MVB is based on the same a priori 
knowledge about the observations as in Example 8.1. Unfortunately, as opposed 
to the MVB in Example 8.1, the MVB of Example 8.2 is a function of the 
unknown parameters and cannot, therefore, be computed beforehand. The 
absence of the unknown parameters in the MVB of Example 8.1 is a consequence 
of the fact that in this example the logarithm of the probability density is quad- 
ratic in the parameters. Example 8.1 is, therefore, an exceptional case. Neverthe- 
less, even if the MVB is a function of the unknown parameters, it remains an 
extremely useful tool. For nominal values of the unknown parameters it enables 
one to quantify variances that might be achieved, to detect possibly strong 
covariances between parameter estimates and to select suitable experimental 
designs. For instance, for the exponential model of Example 8.2 large variances 
and covariances may occur when the difference between the decay constants is 
small. In such cases the conclusion may be that a hypothetical estimator attain- 
ing the MVB would still be not sufficiently precise for the estimation objectives 
concerned. A solution may be to decrease the MVB by a different experimental 
design, that is, to select different values of the and N. If this is not possible the 
conclusion may be that the available observations are not suitable for the 
parameter estimation objectives concerned. With respect to the numerical 
computation of the MVB it should be noted that the information matrix may be 
ill-conditioned. Its inversion, needed to compute the cofresponding MVB, 
must therefore be carried out with care. 

Example 8.3 shows the increase of the MVB of the relevant parameter if, 
in addition, a further, possibly highly irrelevant, parameter is estimated. 
Therefore it is much more preferable to eliminate, if possible, the offset in the 
observations by a different physical experimental set-up than to include the 
offset in the model of the observations and estimate it along with the relevant 
parameters. This illustrates that parameter estimation is a complement to and 
no substitute for the conventional error reducing or eliminating techniques. For 
further, highly instructive, case studies of reducing the number of parameters 
in a practical physical experiment the interested reader is referred to Kaufmann 
and Akselsson (1975) and Van Espen et al (1977). 


8.3 PRECISE ESTIMATORS 


8.3,1 Maximum Likelihood Estimators 

Suppose that in a particular experiment the observations, considered as 
stochastic variables, are a: = (lEi • • • W;„)^ and define ^(<o; 0) as their proba- 
bility density, where the elements of <0 = (cui • • • correspond with those of 
w and 0 = (0j . . . 0^)'^ is the vector of unknown parameters to be estimated 
from a. Now let w = (wj • • • Wjy)^ be one particular realization of w, that is, the 
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elements of w are numbers, not variables Then for that particular realization 
the function L = /,(«,*) with t = (fi fx)'*’ is defined as the likelihood 
function of the parameters Thus L is a function of t Then the maximum likeli- 
hood estimate of the parameters t from w is defined as that value t of t that 
maximizes L For a discussion of the extensne and, unfortunately, relatively 
complicated theory of maximum likelihood estimation see Norden (1972, 
1973) and Zacks (1971) Here only a summary is given of a number of useful 
properties of maximum likelihood estimators The first three properties listed 
below are asymptotic, that is, they are defined in terms of hmits approached 
when the number of observations used goes to infinity In the order in which they 
are listed these asymptotic properties require an mcreasing number of con- 
ditions to be met A discussion of these conditions can be found m the literature 
cited abo\e 

(a) Under very general conditions maximum likelihood estimators are con 
sistent An estimator is defined as consistent for 0 if for any e > 0 
F(Us — 0| < fi) = 1 for N -» 00 where N is the number of observations 
and P(A) is the probability of the event A Note that consistency is a 
property of the area under the probability density on a certain interval. 
It IS not a property of the moments of the probability density A consistent 
estimator is, therefore, not always asymptotically unbiased nor has it 
always a finite asymptotic variance Furthermore, even if a consistent 
estimator is asymptotically unbiased, this is no guarantee for unbiasedness 
for small numbers of observations Many consistent maximum likelihood 
estimators are seriously biased for small numbers of observations 

(b) The asymptotic probability density function of a broad class of maximum 
likelihood estimators is normal with expectation 0 and covariance M"‘ 
where 0 are the true values of the parameters and M“ * is the MVB Note 
that this IS a property of the asymptotic probability density of Xjv, not of its 
moments 

(c) Under certain additional conditions the asymptotic covariance matrix 
of a maximum likelihood estimator equals the MVB 

(d) If i IS a maximum likelihood estimator for the x 1 parameter vector 
0, then g(t) IS a maximum likelihood estimator of the L x 1 vector g(0) 
of one-to-one functions of 0 where L ^ K This property is usually re- 
ferred to as tniariance property 

Example 8 4 Estimation of the slope of a linear relationship Consider the 
estimation of the parameter a of the model jy, = ax„ -t- discussed in Example 
8 I Suppose that the observations Hj, of vy„ are available Then the 

likelihood function of the parameter a is described by 

L = expf-k-" Y. K - 
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Since L and In L are monotonic, maximizing L and maximizing In L with respect 
to a are equivalent. Hence the maximum likelihood estimate a of a is that value 
of a that maximizes 

In L = —jN ln(2;t) — N In o- — Y, ~ 

n 

Equating the derivative with respect to a of this expression to zero yields 

~ _ YnW„X„ 

The maximum likelihood estimator is therefore 

- Yn )VnX„ 

' y 

/ . n 

Since in this expression the x„ are exact, 5 is a weighted sum of independent 
stochastic variables w„. Hence var a is the quadratically weighted sum of the 
variances of the vv„. These variances are all equal to Hence 

var a = 

Moreover, £(g) = a since E(]v„) = ax„. So a is unbiased. 

Example 8.5 Maximum likelihood estimation of the parameters of an 
exponential model. For the estimation problem of Example 8.2 the logarithm 
of the likelihood function is 

-^N ln( 27 i) - N In <T - 



where 


dnW = -Yh expC-nifeXj 

k 

where the elements of t = (1^ 1^ nij • • • correspond with those of 0. 

Then equating to zero the gradient with respect to t of In L yields the following 
equations to be satisfied by the maximum likelihood estimate t of 0; 

Y rfn(t)exp(-i7UX„)!,=T = 0 k = l,...,K 

n 

Y dn(^)lkX„ exp(-mfcX„)|t=-{ = 0 k = l,...,K 

k 

Discussion of the examples. The maximum likelihood estimator of Example 
8.4 has a number of favourable properties. In the first place, it is unbiased. 
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Furthermore, comparison of its variance with the MVB computed in Example 
8 1 shows that it achieves the MVB for any number of observations This is an 
illustration of a theorem stating that if there exists an unbiased estimator having 
the MVB as covariance, it is the maximum likelihood estimator Moreover, the 
maximum likelihood estunator of Example 8 4 is closed form and corresponds 
to a unique maximum of the likelihood function The estimator of Example 8 2, 
on the other hand, does not achieve the MVB for a finite number of observations, 
as can easily be shown Furthermore, it may seriously be biased if the number of 
observations is small For illustrative examples see Van den Bos (1979) Also 
the expressions for the estimator are implicit and non-linear and can, as a 
rule, only be solved numerically Examples of relevant numerical techniques 
are described m Section 834 TTie solution is further complicated by the fact 
that a likelihood function, as that in Example 8 5, may have more than one 
maximum From these the absolute maximum must be selected The maximum 
likelihood estimators of Examples 8 4 and 8 5 have in common that they are both 
equivalent to least squares estimators This important property will now be 
studied in a more general form 

Suppose that the observations w = (ur, are descnbed by the general 

model 

= + n=l, (81) 

whereihex, = (Xi„ x^^J^areknowmO = (Bj is the vector of unknown 

parameters, 0) belongs to a known parametric family of functions and 
i = (Ci normally distnbuted with £{y) = 0 and covariance V = 

cov(i, j), where 0 denotes the null matrix Then the probability density of y is 
described by 


X(u) = (2;r) '''^(det V) ‘/*exp(— ^u^V *u) 

Hence, the log likelihood function of t, given the observations w = (w^ Wf,y, 

IS 

— JlV ln(27r) — ln(del — g)’^V"Hw — g) 

where g = (g(xi,t) g(xjv, t)y Maximizmg this function with respect to t 
IS equivalent to minimizing (w — g)’^V'(w — g) Hence, if in the model 
(equation (8 1)) the errors are normally distributed with known covariance 
matrix, maximum likelihood estimation of the parameters is equivalent to 
weighted least squares estimation with the elements of the inverse covanance 
matrix as weights In practice least squares estimation techniques are also 
widely apphed to estimation problems where the above conditions are not met 
Therefore, Section 8 3 2 and 8 33 will be devoted exclusively to these important 
techniques 
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8.3.2 Linear Least Squares 

Suppose that in the model (equation (8.1)) the function g(x; 0) has the par- 
ticular form 

fif(x; p) = i?ihi(x) -f- + • • • + )5i/ii(x) 

where p = (jSi • • • is the vector of unknown parameters and the /i,(x) are 
known functions which do not contain elements of p. So the additively error 
corrupted observations vv„ are linear in the parameters and can conveniently 
be summarized in the form 

w = XP y (8.2) 

where the n, I element of the N x L matrix X is defined as hi(x„). First three 
examples of this important model will be given. 

Example 8.6 Linearity in the independent variables. Suppose 

= Pi^nl + /^2^n2 + • • • + pL^nL + A n = I, ..., N 

where . . . , are the parameters and . . . , are the independent variables. 
Then the observations are described by equation (8.2) with X defined as the 
N X L matrix with n, I element 

Example 8.7 Polynomial. Let 


^ = I, . . . , N 

Then the observations are described by equation (8.2) with the n, I element of the 
N X L matrix X defined as . Note that the observations are non-linear in the 
independent variables but linear in the parameters. 

Example 8.8 Discrete-time impulse response. Suppose w(n), n = 1, ..., N, 
are error-corrupted observations of the response of a linear discrete-time system 
having impulse response . . . , to an exactly known input ^(n) where n 
denotes discrete time. That is, 

Mn) = p,^(n) + p^^in - 1) 4- ••• -1- p^^n - L + 1) 4- v(n) n = 

where p{n) is a discrete-time stochastic process representing the errors. Then 
the observations may be summarized in the form of equation (8.2), where w = 
(w(l)w(2) • • - w{N)y, X is the N x L matrix having n, I element ^{n + I -\- 1) 
and V = (t;(l)t;(2) • - • v{N)y. 

Consider again the model 

w = XP 4- V 
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For the moment drop the assumption that j is normally distnbuted and that 
Its co^ ariance is know n- Then least squares estimation of p from w can generally 
be described as minimizing the least squares cnterion 

(w - xb)'^n(K - xb) 

with respect to b where is any positive definite symmetnc weighting matrix. 
The solution b for b which minimizes the abov e cnterion satisfies 

b = (x^ftx)"'x’^nw 

For a proof see Eykhoff (1974) This referents also discusses a number of 
useful properties of the estimator b which may be summarized as follows* 

(a) b is closed-form and unique 

(b) 6 IS linear m the observations u 

(c) bis an unbiased estimator for p 

(d) co\(b,b) = (X^nx) ‘x^nvftx(x'^ftx) ' 

(e) Ifn = V-‘thencov(b.b) = (X'‘V-'X) ‘ 

(f) If n = V *, b has smallest covanance among ajl unbiased estimators of 
p linear in w Therefore, in this particular case, b is usually referred to as 
best linear unbiased esfimofor of P Note that, loosely speaking, this par- 
ticular choice of weightmg matrix implies that m the least squares cntenon 
the differences between the model Xband the observations w are weighted 
according to the precision of the latter 

(g) If X ts normally distnbuted with covanan^ V, the covariance (X^V* ‘X)" * 
of the estimator b = (X^V" *X)” ‘X^V" is equal to the MVB for any 
number of observations 

If, in practice, the covanance of the errors is unknown, the identity matrix is 
often chosen as weighting matrix. This is usually referred to as uniform!) 
v.eighting Then the resulting least squares estimator is descnbed by 

6 = (X^X)-*X^w (83) 

From the above properties it is concluded that generally this estimator has 
no optimal properties It is, however, unbiased If the covanance of the errors 
IS known it is w orthw hile to w eight according to its inv erse, since this yields the 
most precise estimator among all unbiased estimators hnear m the observations 
Note, however, that there may be more precise non-linear estimators or biased 
ones Generally, the MVB is not achieved On the other hand, in the special 
case that the errors are normally distnbuted, the best linear unbiased estimator 
coincides with the maximum Iikehhood estimator and, m addition, achieves the 
MVB for any number of observations These considerations illustrate how a 
priori knowledge about the errors can be exploited to select an estimator or to 
improve the preasion 
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The numerical evaluation of the least squares estimator should be carried out 
with care, since the set of linear equations to be solved is often ill-conditioned, 
that is, the equations are nearly dependent. It may, therefore, be advisable to use 
algorithms that have especially been developed for linear least squares. For a 
discussion of these algorithms and corresponding ALGOL programs refer to 
Wilkinson and Reinsch (1971). Special programs for linear least squares are also 
included in scientific program libraries such as NAG (1978) and IMSL (1977). 

Finally, it must be emphasized that the theory and the practical aspects of 
linear least squares discussed here are only a fraction of the material available. 
For most extensive discussions the reader is referred to Draper and Smith 
(1966) and Seber (1977). Fedorov (1972) is specialized to optimal design of 
linear least squares experiments. 


Recursive linear least squares 

The least squares estimate (equation (8.3)) computed from N observations may 
be written in the form 

where 

Pat = (X^ Xy ) " ^ and = (wj • • • Wa,)'^ 
with 


Xat = (x , ; • • • i x^)^ and xj = (x„i ■ ■ ■ x„i). 


Now suppose that one additional observation j, X;v + 1 is made. Then it can 
be shown that bjv+i satisfies 

^A'+l = W + gAf+l(Wjv+l ~ xjJ+ibA') (8-4) 


where 


white 


gx+i — P/v^iv+i/(l "b xjv+iP/v^JV+i) 


(8.5) 


Piv+i — Pjv ~ Sa+i^n+iPn (8-6) 

For a proof see Goodwin and Payne (1977). Using these expressions the compu- 
tation of from b^, Xjv+i, and P,v carried out as follows. 

First the L x 1 vector g^^+i is computed from P^v and x^v+i. Note that in the 
expression for the quantity xJ+lP;vX^+l is scalar. The computation of 
|’x + 1 from bjv, j, vv^+ ^ and 1 is then straightforward. Finally, P;^+ 1 , which 

is required in the next recursion, is computed from x^+ 1 , gtj+ 1 , and Pj^. Initial 
conditions P^^^ and b^^^ can be non-recursively computed from the first Nq 
observations. With this particular choice of initial conditions the recursive 
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estimate for any N is equal to the estimate that would have been obtained 
non recursively from the same observations Furthermore, it is observed that 
in the expression for b;v + 1 the difference , — xjv + 1 b;y is the deviation of the 
observation Wjy+i from a predicted value based on the current estunate bjy 
and x^+ „ while g;v + 1 'S a varying vector of weights determining to what extent 
this deviation is taken into account to modify bjy 

Recursive least squares may be particularly useful when the number of 
available observations is relatively large The estimates may then be computed 
recursn ely for an increasing number of observations until they have satisfactorily 
converged Thus often only a fraction of total number of available observations 
need be taken into account to meet the objectives of the measurements Con 
sequently computation time is saved 

A further application of recursive least squares is parameter tracking that is, 
estimation of time varying parameters For that purpose a number of modified 
versions of the above algorithm have been developed For a comprehensive 
description of these algorithms see Goodwin and Payne (1977) A characteristic 
example of a tracking algorithm will now be discussed 

Example 8 9 Exponential forgetting In this example Jt will be supposed 
that the Nth observation is made at time N, that is, at an integral multiple 

of a fixed unit time interval Furthermore, the following least squares cntenoo 
will be chosen 

(w« - - X^b) 

where \ i) with 0 < ij ^ 1 Thus m this cnterion 

the weights of the quadratic deviations of the model from past observations are 
exponentially decreasing with time Then the least squares estimator of p is 
descnbed by 

6^ = PjX^£l;vWjv 

where P^y = (XjJl^yX^,)"* This estimate satisfies the following recursive 
equations 

bjv+i = bjv + Sff+i(wjv+i — Xjv+ib^) 

where 

gw+i = Pj»Xjy+|/(q + xJ+jP^x^+i) 

while 


Pw+i = (P« - giv+ixJ+iP«)/»7 

Note that substitution of i/ = 1 m these equations yields the original recursive 
least squares scheme Then all past observations fully contribute to 
the current estimate On the other hand, if ^ is small, only a relatively small 
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number of past observations effectively contribute to the estimate. So for small t] 
the estimate responds quickly to parameter changes, but is relatively imprecise. 
Consequently, in practice, the choice of rj depends on the objectives of the 
measurement and is always a compromise between response rate and precision. 

8.3.3 Non-linear Least Squares 

Suppose that the problem is the estimation of the parameters 0 from observations 
w = (wj • • • described by 

= 0 (x„; 0) + n= I,..., N (8.7) 

where the x„ are exactly known independent variables, the v„ are stochastic 
variables representing non-systematic errors and ^(x; 6) is a known function 
which, as opposed to the functions considered in Section 8.3.2, is non-linear in 
one or more elements of the vector of unknown parameters 0. For this model 
the weighted least squares estimate of 0 is defined as that particular value t of t 
that minimizes 

j(t) = (w - - g) 

whereg = • • • g(X}^; t)^ and is a symmetric, positive definite weighting 

matrix. 

With respect to this non-linear least squares estimator the following remarks 
can be made; 

(a) Ify = is normally distributed with expectation zero and known 

covariance V then the least squares estimate f which minimizes 

J(t) = (w - g)^V“ ‘(w - g) 

is the maximum likelihood estimate of 0 (Section 8.3.1). The estimator t is, 
therefore, generally consistent and converges in distribution to a normal 
distribution with expectation 0 and covariance where M is the 
information matrix. Moreover, it is easily shown that 

M = (dg/dtyy-\dg/dt) 

Again it is emphasized that consistency and convergence in distribution 
are asymptotic properties. 

Example 8.10 Measurement of radioactive decay and concentration. The 
mathematico-physical model of radioactive particle count vv„ at time x„ may be 
described by 

= I] exp(-pjx„) -h Un 

k 
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In these expressions 0 = A* fij where A^ is a measure of the 

concentration and /ij is the decay constant of the Ath component Furthermore, 
the counting result iv„, is supposed to be a Poisson distributed stochastic 
variable This implies that £(h J «= varfw,) Hence, if £(w„) > 

£6t;n) iv„ and var(w„) as u* Then, by the rentral limit theorem, is asymp- 
totically N(E(w„), Supposing that cov(\v„|,iv„j) = 0 for ni 

one obtains V as diag(iv,, ,w^) Hence minimizing the non-hnear least 
squares cnterion with the inverse of this V as weighting matrix asymptotically 
approximates the maximum Iikebhood estimator This again illustrates the use 
of a prion knowledge to enhance the precision of an estimator 

(b) In practice, if y is supposed to be normally distributed with unknown 
covariance, I is often taken as weighting matrix n The resulting estimator 
is not a maximum likelihood estimator unless V « aH 

(c) If V has known covariance V but is not normally distnbuted, non linear 
least squares with fl = V ' is often used The idea is again to give smallest 
weight to the largest deviations of the model from the observations The 
estimator so obtained is generally not a maximum likelihood estimator 

(d) If the distribution and covariance of y are unknown, it is common practice 
to use least squares with H » I Again this is generally not a maximum 
likelihood estimator 

(e) Suppose that m equation (8 7) the are nd with covariance V = all Then 
the estimator t of 0 which minimizes 

J(t) = (w - g)V - B) 

IS, under very general conditions, asymptotically normally distnbuted 
with expectation 0 and covariance U defined by 

u = (x2[(dg/50)T(dg/ae)]-* 

This, result vs due ta leantveh. {19^9] Mate that vC v vs. idd. and. normal., U is 
the MVB, while this is generally not true if y is non-normal This result is a 
useful tool to investigate the preasion of the uniformly weighted least 
squares estimator for idd errors and a large number of observations The 
properties of non-hnear least squares estimators for a relatively small 
number of observations may essentially deviate from those m the asymp- 
totic case For illustrative examples of this phenomenon see Van den Bos 
(1979) 

(f) It IS easily shown that 

grad J(t) = -2(dg/dt)^n(w - g) 

Since the elements of g are non Imear in t so are those of the gradient 
This implies that there is generally no closed form solution for the equations 
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grad J(i) = 0 which are necessary conditions for a minimum of J(t) at 
t = f. Hence these equations must be solved numerically. Therefore in 
Section 8.3.4 a short discussion will be given of a number of practical 
numerical minimization techniques. 


8.3.4 Numerical Minimization 

In what follows three different methods for numerical minimization are briefly 
discussed. For a more complete discussion of these methods and many others 
the reader is referred to Murray (1972), which also includes a chapter on 
numerical non-linear least squares. 

The first method to be described here is the steepest-descent method. This is a 
general minimization method. It has not especially been designed for non-linear 
least squares. The steepest descent method changes the current value of the 
parameter vector by an amount 

AtsD - -A grad[J(t,)]/l|grad J(t,)|l 

where, since the gradient is normalized, A is the step size. So Atso is opposite to 
the gradient. It can be shown that infinitesimally in this direction J(t) at t = 
decreases most. The steepest-descent procedure in its most primitive form may 
be summarized as follows; 

(a) Compute Atjo in t = t^. 

(b) Compute J(t<, -f Atso). If J{t^ 4- Atso) ^ 7(t^) select t^ 4- Atgo as new t,. 
and repeat from (a). If not, reduce A and repeat (b). 

Important properties of the steepest descent method are: 

(a) Under very general conditions convergence to a minimum is guaranteed. 

(b) AtsD is perpendicular to a contour of J(t). 

For non-linear least squares problems the latter property has some unfavourable 
consequences. In most non-linear least squares problems the minima are located 
in elongated, curved valleys, in the literature frequently referred to as banana- 
shaped valleys. Consequently, as a result of property (b), in the neighbourhood 
of a minimum the method proceeds along a zigzag path and converges extremely 
slowly. Far from the minimum, however, the method progresses rapidly. 

The slow convergence of the steepest descent method near the minimum can 
be avoided by using alternative methods of which the Gauss-Newton method, 
a special method for non-linear least squares, will now be discussed as an 
example. Recall that the problem is minimizing 


J(t) = (w - g)'^(w - g) 
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where w = (»Vi and g = {fj(X| , t) ^^(Xjv, In the Gauss-Newton 

method 5 (x,, t) is, m the neighborhood oft = tg, approximated by its first-order 
Taylor expansion Then 

gli=..+Ai « Bit u + 

where X = 5g/otatt = and At is a small increment oft This expression, which 
IS linear m At is substituted for g m the abo\e expression for J(t) The resultmg 
approximate cntenon, J(t), is subsequently minimized with respect to At This 
IS a standard Imear least squares problem which has the closed form solution 

-^^X)-'gradJ(t)l,=.^ 

where use has been made of the expression for the gradient described in Section 
8 3 3 For what follows it is important to note that the contours of 7(1^ + At) 
are ellipsoids In its most elementary form the Gauss-Newton iteration scheme 
may now be described as follows 

(a) Compute Atos from dg/ct and (w — g) at t = 

(b) Select te + Atov as new t« and repeat from (a) 

Important properties of the Gauss-Newton method are 

(a) As a rule the method diverges fat from the minimum 
0>) Far from the minimum the direction of the Gauss-Newton step is usually 
\ery close to the direction of the contour 

(c) Qose to the minimum the method comerges rapidly 

The last property is a consequence of the fact that as the minimum is approached 
the contours of the cntenon become ellipsoids and can, therefore, mcreasingly 
accurately be approximated by the quadratic cntenon J(t^ + At) Comparmg 
the properties of the steepest-descent method with those of the Gauss Newton 
method suggests combining both methods into one smgle method, which 
gradually changes from the steepest-descent direction to the Gauss Newton 
as TfiVmTauin is appioatVied and *«ViTCfe, m addAwti, Ws 

step size This is what is done by the Marquardt method described in Marquardt 
(1963) This method may be summanzed as follows In t = tj the Marquardt 
step At^Q IS defined by 

(X^X + AI)At^q = —j grad Jft^) 

where, as before, X = cgfdt at t = and A is a positive scalar to be specified 
later If for a particular } this set of equations is solved for Atyg the resulting 
solution can be shown to minimize J(te + At) on a sphere ||Atl)^ ilAtygll 
From the definition of the Marquardt step it follows that AtMQ approaches 
Atov as X approaches zero On the other hand if A is increased, Ats,Q approaches 
Atsu while the step size goes to zero Furthermore, it can be shown that both 
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IIAt^Qll and the angle between Also and At^q are decreasing functions of X. 
Hence At^g rotates towards At^o as X is increased. 

The Marquardt iteration scheme may be summarized as follows. Let v > 1 
and define and X^ as the current t and X respectively. Then an iteration con- 
sists of the following steps; 

(a) Compute AtMq for X = X^ ~ X^ and for A - /I 2 = XJv respectively. Next 
compute the corresponding values of J(te + AtMq). 

(b) If J(t^ + AtMq)l;ij ^ J(tc\ select X 2 and + AI^q as new X^ and t, and 
repeat from (a). Otherwise proceed with (c). 

(c) If J(t, + AtMq)Lj > J(Q and J(t, + AtMq)l;., ^ Htc), select Aj and the 
corresponding quantity t,, -f- At^jq as new A^ and and repeat from (a). 
Otherwise proceed with (d). 

(d) Take vA<. as new A,, and repeat from (a). 

Note that, within restrictions imposed by convergence requirements, A is kept 
as small as possible so as to retain as much as possible of the favourable con- 
vergence rate of the Gauss-Newton method. 

Important properties of the Marquardt method are: 

(a) Under general conditions convergence to minimum is guaranteed. 

(b) Rapid convergence in the neighbourhood of the minimum. 

Nowadays the Marquardt method, which works very well in practice, is 
included in a number of scientific program libraries (see e.g. NAG, 1978; 
IMSL, 1977). 


8.4 ESTIMATION OF PARAMETERS OF DYNAMIC DIFFERENCE 
OR DIFFERENTIAL EQUATION MODELS 

8.4.1 Introduction 

This section discusses estimation of parameters of models of dynamic systems 
or stochastic processes. The class of models is restricted to ordinary difference 
or differential equations since these frequently occur and naturally arise in 
practical problems. Within this class one is still free to make a choice between 
difference of differential equation descriptions and with respect to the order of the 
right-hand and left-hand members of the equation. Again the decision between 
the alternatives is closely connected to the purpose of the estimation procedure. 
For example, for automatic control purposes an accurate description of the re- 
sponse of the system to an arbitrary input may be sufficient. In this case the 
difference or differential equation model need not be an accurate description of 
the physical system involved. The same applies to difference equation models of 
stochastic processes to be discussed below. In this case the purpose is often a 
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compact quantitative description of the second-order properties of the stochastic 
process in the form of a small number of parameters On the other hand, if the 
purpose IS estimation of physical parameters, the differential equation model 
should accurately describe the relevant part of physical reality and follows 
directly from physical considerations, for example, conservation laws 


8.4.2 Discrete-time Difference Equation Models 

First a number of relevant properties of discrete-time stochastic processes are 
summarized Let x(nA) and ^(nA), /i = 0, ± 1, ±2, , be stationary, discrete- 

time stochastic processes where A is a constant time interval To simplify the 
notation m what follows the time scale will be chosen so that A = 1 Then the 
cross'coiariance function of x{n) and ^(n) is defined as 

C,/L) = £{[x(/i) - + k)- Py']) 

where = £[,x(m)] and Py =* £[i(n)], while xht autocovariance function oix(jn) 
is defined by 

Cxx(*) = £{D£(n) - + *) - /^x]} 

Furthermore, the cross-poucr density spectrum of ^(n) and ^(m) is defined 

as the infinite discrete Fourier transform of Cxy(.k) 

Sx/w)= f C,/*)exp(-jQjA) 


Note that S„(co) is periodic with 2n The corresponding inverse Fourier 
transform is defined as the sequence of Fourier coefficients Cj.y(k), 4 = 0, 
±1, ±2, , of S„(co) 


Cxyik) 


= rj: 


5,^(ct»)exp(jfccu) dm 


Sxxfcu) IS referred to as the power-density spectrum or simply power spectrum 
of2(n) 

An important relation between the power spectra of input processes and 
corresponding response processes of linear discrete-time systems concludes 
this summary Let the response of a linear discrete-time system to a unit impulse 
input applied at time fe = 0 be ,0,0,4(0),^(1),^(2), This sequence defines 
the discrete-time impulse response of the system Then the discrete-time frequency 
response function jffeu) of the system is defined as the infinite discrete Fourier 
transform of the impulse response, that is. 


= £ ^(fc)exp(-jaifc) 
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Next suppose that the stationary stochastic process u(n), n = 0, +1, ±2,..., 
is the input of a linear discrete-time system with frequency response function 
jf(a>). Furthermore, let y(n) be the response to m(h), n = 0, + 1, ±2, . . . . Then 
it is not difficult to sho\^ that the relation between the power spectrum 
of w(h) and the power spectrum 5„.(a)) of y(n) is described by 

5,/to) = lJf(co)i%^(co) 

If the autocovariance function of m(m) has the particular form 

CuuiJ^) — 

where the Kronecker delta S„ „ is given by 

fl if m = n 
" 1 0 otherwise 


u(n) is usually referred to as an uncorrelated or white process. Then the cor- 
responding power spectrum satisfies 

Hence, if u(«) is a white input process the power spectrum of the response is 
described by 

Syyico) = |,?f(cu)pcr^ 

So in this particular case the shape of Syy{co) is completely determined by the 
frequency response of the system. This result will be used extensively throughout 
this section. 

A model frequently used in practice to describe linear dynamic discrete-time 
transformations of an arbitrary stochastic process w(n) into a process y(ji) is 
the following: 

l(n) + aii(n - 1) -t- 1- cci^y(n - K) 

= PoUi") + huLn - 1) -f • • • + Pi^uin - L) 

In the literature this linear difference equation model is usually referred to as 
autoregressive-moving average (ARMA) model. If /?i = • • • = = 0 the model 

is called autoregressive (AR), all-pole or linear prediction model. If aj = • • • = 
% = 0 the model is referred to as moving average (MA) model. It is easily shown 
that the frequency response of an ARMA model is described by 


^(co) = exp(-jQ^) + ■■' + Pl exp(-jmL) 

1 + exp(— jcu) + b 0 £i; exp(— jcoX) 

Hence, if y(n) and u{n) are stationary 


S„(a)) = 


Po + Pi exp(-jQ)) -!-■■■ + Pl exp(-ja)L) 
1 -f exp(— jcu) -b ■ • • + aj^; exp(— jcuX) 
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Now suppose that u(n) is uncorrelated So Sjico) = Then the expression for 
S„(a)) shows that the power density spectrum of any discrete time, stationary 
stochastic process ^n) can be described as, or approximated by, the power- 
density spectrum of the response process of an ARMA model with a sufficiently 
large number of suitably chosen coefficients to an uncorrelated discrete-time 
stationary stochastic process u(n) Thus estimation of ARMA parameters from 
observations y(l), ,^N) is a parametric alternative to classical non-para- 

metnc Founer techniques as, for example, described by Jenkins and Watts 
(1968) The question now arises how the coefficients of the ARMA model can be 
estimated from observations of i(n) In what follows this will only be discussed 
for AR models smce these are much more frequently applied m practice than 
ARMA models One of the reasons is that for the AR parameters relatively 
simple closed form estimators are available while estimators for ARMA para- 
meters require an iterative solution 

An extensive survey on estimation of parameters of AR models and related 
subjects including many references is Makhoul (1975) Haykin (1979) discusses 
estimation of parameters of both AR and ARMA models The maximum likeli- 
hood estimator of the AR parameters discussed m Section 8 4 3 is due to Mann 
and Wald (1943) 


8 43 Estimation of the Parameters of the Autoregressne Model 

Suppose i’(n) is a discrete time stationary stochastic process For the moment also 
suppose that ).<«) is normally distribute Furthermore, assume that the follow- 
mg autoregressive model for ihei<n) is chosen 

- 1) + + - eCn) 

where the e (n) are idd and N(0, cl) Then the maximum likelihood estunate 
t = (fl, OjfS,)^ of the parameters 0 = (a^ o-rgI)^ from the N •¥ K 
observations — JC), j (2 — K), ^ may be computed as follows In the 

first place it can be shown that Oi, , satisfy 

ciQ,k} + aica,k)+ + *) = 0 fc = 1, ,K (8 8) 


and 


= c(0, 0) Oic(l, 0) + -I- oj^c(A’, 0) 


c{k„k,) = — 


kiMn - k^) 


where 



PARAMETER ESTIMATION 


355 


So the numerical procedure could consist of computing the c(/ci, /C 2 ) for k^, 
/cj = 0, . . . , K, solving equation (8.8) for Uj, . . . , 3^, and using this solution to 
compute Sg. However, it is observed that under very general conditions all 
c(fci, fej) with - /c 2 i = fc are consistent estimators of Cyy(k). They are, 
therefore, asymptotically equivalent and are in practice often replaced by c(k, 0). 
Then the above estimator may be approximated by 


/ c(0) c(l) 
/ c(l) c(0) 

W-1) • 


c(K - 1)' 



c(0) 

c(l) 


c(l) 

c(0) 



where c(k) = c(/c, 0). Note that all elements on a particular north-west, south- 
east diagonal of the coefficient matrix are the same. It will be seen later on that 
this special structure can be exploited to reduce the number of operations re- 
quired for the solution of the set. 

Finally, it is easily shown that the MVB for unbiased estimation of 6 = 
(ai • • ■ o-Kaiy is described by 


( ' C,,(0) Cyyd) ■ ■ Cyy{K-l) 0^^ 

CyyW Cyy(0) * j ^ 0 ^ 

: . . CyyiO) Cyy(l) 0 

Cyy(K - 1) • • Cyy(l) Cyy(0) 0 

, 0 0-0 02/ 

For large N the covariance of t can be estimated by replacing the Cyy{k) and 
and (Te in this expression by their estimates. 

The above results apply to normally distributed e{n). In some respects, 
however, they are more generally applicable. In the first place for idd, but not 
necessarily normal, e(n) the following relations are valid: 

Cyy{k) + (XiCyyQc — l) "f f “ K) = 0 wfiero /c ^ 1 

In the literature these relations are usually called Yule-Walker equations. 
Now suppose that a is estimated by replacing the CyyQc) in the first K Yule- 
Walker equations by their estimates c(k) and solving these equations. Then it 
can be shown that the resulting estimator, the expression for which coincides 
with that for the approximate maximum likelihood estimator in the normal 
case, is asymptotically normal with an asymptotic covariance described by the 
same expression as the MVB in the normal case. Note, however, that in the 
non-normal case this asymptotic covariance matrix is generally not the MVB. 
For a discussion of a general class of estimators, called prediction error esti- 
mators, having these properties the reader is referred to Chapter 5 of Goodwin 
and Payne (1977). 
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Durbin's method for the computation of autoregressive coefficients 

The first K estimated Yule-Walker equations and the equation for the estimated 
vanance of the e(ri) for an autoregressive model of order K are respectively 

c(k) + a<f^c(k - 1) + +^cik~K) = 0 k=l, ,K 

and 

= c( 0 ) + a',«c(i) + + aPc{K) 

Using a method due to Durbin (1960) one can exploit the special structure 
of these equations to reduce the number of numerical operations In Durbin’s 
methodtfj*^, ,S5^\Se'^andc(K + l)areusedtocomputefl\**‘^ 
and So the current coefficient and vanance estimates are used to com 

pute the corresponding quantities for the next higher order model This is 
continued until in the sense of some stopping criterion no further improvement 
IS achieved by increasing the model order The equations used in Durbin’s 
method are as follows 

alV." = -WK + 1) + a'i'’c(K) + + 

Sf* ‘1 = at'i + al'.V’aK 1 -» * = 1. 

where ^ = 0, 1, , while the initial conditions are descnbed by «= 0 and 

= c(0) From these equations u follows that the number of multiplica- 
tions in one step of Durbin’s method is roughly 2K Hence, computation of the 
coefficient estimates for all models of order up to and mcluding K requires 
approximately multiplications This is substantially less than with con 
ventional methods For example. Gauss elimination requires + 0{K^) 
multiplications for a model of order K alone Choleski decomposition, special 
ized to solution of symmetnc, positive definite sets of equations uses + 
0{K^) multiphcations for the same purpose So with these methods computa- 
tion of the coefficients of all models of order up to and including K requires a 
number of multiplications of order K* This is, of course, only a fair comparison 
if the estimates of the coefficients of all lower-order models are actually needed 
Furthermore, whether or not a substantial reduction in the total computation 
tune is achieved by Durbin’s method, also depends on the number of observa 
tions taken into consideration smce this determines the number of multiplica- 
tions to be earned out for the computation of the required c{k) For a large 
number of observations the latter number of multiplications may considerably 
exceed the number of multiplications required for solution of the Durbin 
equations On the other hand, if the observations have been quantized by an 
analog-to-digital converter, as is often the case in practice, the cif) may be 
computed m fixed point Fixed point operations are usually much faster than 
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floating point operations, as are required for the solution of the Durbin equa- 
tions. In some applications, for example, in geophysics, estimation problems 
are found giving rise to sets of equations similar to the Yule- Walker equations 
but with an arbitrary right-hand member. That is, the right-hand member does 
not necessarily consist of — c(l), . . . , —c(K). Then use can still be made of the 
particular structure of the left-hand member by using a method due to Levinson. 
This method requires twice as many multiplications as Durbin’s method and is 
described by Robinson (1967). 

Recursive estimation of the autoregressive coefficients 

A recursive form for the estimator 

5ic(l, k) + ■■■ + a,^c(K, k) = — c(0, k) k = 

discussed earlier in this section, can be obtained as follows. Let be the vector 

(«]••■ (IkV computed from y(l - K), y(2 - K),..., y(0), y(l) yiN). 

Furthermore, define 

' y(0) y(-l) ... y(~K + l)\ /xj\ 

y(l) y( 0 ) ... y(-/C + 2 )\ /xj \ 

y(N~l) y(N-2) ... yiN-K)/ \xj/ 

— O^n^n) ^ ^nd = yiN) 

Then a;v niay equivalently be written 

Hence, using the recursive least squares formulae of Section 8.3.2 one obtains 

^iV+i = 3^’ + giv+iC'Vjv+i ~ 

where 4 . i) and xj+i = — 0'(A/) • • - y(N — K + 1)). The vector 

g.v+i in this expression is computed from Pjv and x^+i while P^. is recursively 
computed from P^.+ j and x,v, as described in Section 8 . 3 . 2 . 

8.4.4 Estimation of Parameters of Dynamic Systems from Input-Output 
Observations 

The difference equation models discussed up to now were models of stochastic 
processes. These stochastic processes were considered to be exactly measured 
responses of a system described by the difference equation to an unmeasured 
white input process. The problem to be discussed now is estimation of the para- 
meters of a linear difference equation model from observations of both the input 
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£(ri) 

tS—, 

y(nj win) 


Figure 8 1 System model having stochastic 
input u(n) to the transfer function and non 
systematic errors input £(n) 

and the corresponding response In addition, it will be supposed that the obser- 
vations of the input are exact while the observations at the output consist of the 
sum of the true response and a stochastic process representing non-systematic 
errors In measurement practice this model is frequently found to adequately 
descnbe observations of mput and response of dynamic systems The case that 
both the observations of the input and those of the response contain additive 
errors will be discussed later on 

The model to be studied first is shown in Figure 8 I In this figure u(n) is the 
input which, for the moment, will be assumed to be a stationary, zero mean, 
stochastic process The response to u(n) is ><n) which is also assumed to be a 
stationary stochastic process This implies that the system is stable and that any 
transient responses to «(n) have disappeared The process £(n) represents the 
non-systematicerrors in the response observations K(n) Finally, 
represents the system’s discrele-ttme transfer function to be specified below 
Now suppose that the system is described by the hnear difference equation 

^<n) + cti^n - 1) + + otKlin - K) 

= i5oM(n) + piu{n - 1) -f + pMliin - M) 
with K > M Then the problem studied here will be estimating 
0 = (ai ttx Po 

from input-output observations k(1 - K), u(I — K), h( 2 - K), w(2 - M), , 

For convenience first the above difference equation is rewritten m the form 

where 

•n^(r*‘) = 1 + ffiZ”* + + a^z”* 

+ Pt^~ ■ + + Pu2~'‘ 

while z IS the forward shift operator defined by z><n) = >(n + 1) or z~ *y(n + 1) 
= yfn) Then it follows from Figure 8 I that ~ 

■>3^(2 "'')«(«) = ^(z“ *)«(«) + q(n) 


1) 
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where 

q(n) = s/(z-^)p(n) (8.9) 

This model suggests selecting as an estimator for 0 that value t of 

t = (uj • • • % ^0 ■ ■ ■ 

that minimizes the least squares criterion 

m = £ [ACz-^)w(n) - B(z-^)u(n)r 

n — X 

where A{z~^) = 1 + + ■ ■ • + and B{z~^) = ho + + • • • + 

bf,iZ~^. The motivation for this choice is, of course, that t is a simple, closed 
form, linear least squares estimator. Unfortunately, it can be shown that for 
h/ ^ 00 the difference t — 0 only converges to zero if the covariance function 
Cpp{k) of p(n) satisfies 

,5/(z-i)Cpp(/c) = 0 /c=l,...,K 

For a proof for first-order J^iz~^) see Goodwin and Payne (1977). This proof 
can easily be generalized to the above result. Recalling that s/(z~^)p{n) = q(n), 
one observes that the above conditions for convergence are equivalent to the 
Yule-Walker equations if q(n) is white. Conversely, it can also be shown that 
under the above conditions q(n) is necessarily white. Hence a Tiecessary and 
sufficient condition for the above least squares estimator to converge is that q(n) 
is white. Note that this is not equivalent to the statement that t converges to 0 
if the errors in the observations of the response are white. It is clear that in most 
cases q(n) will not be white and t will consequently be a biased estimator of 0. 
Therefore the use of? is generally not advisable, the more so as asymptotically 
unbiased alternatives for t are available which will now be discussed. 


The generalized least squares method 

Suppose that in the above model of the observations 

^{z~^)w(ii) = ^{z~^)u(n) + q(in.) 
the process q(n) is modelled as an autoregressive process 

q(n) = ^~^(z~^)e(n) 

where ^(z"‘) = 1 q- yiz~^ -b • • • -b yi,z~^ and e(n) is white. Then 
j^(z“^)w'(«) — ^(z~^)u'(n) — e(fi) 


( 8 . 10 ) 
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where u'(n) = 2?(2" ‘)H(n) and u'(n) == Then, since £(n) is white, the 

value t of t that minimizes 

■'(•)= ')«'(>■)- 

IS an asymptotically unbiased estimate of 0 Unfortunately, is usually 

unknown To see this, suppose that the errors p(n) satisfy p(n) = ^ '(z“*)e(n) 
Theng(n) = ^ ‘(z ^)s/(z *)e(n) = ^“‘(z“‘)e(n) Hence, 

<?(z‘*) = S(2-‘)^-‘(z *) 

In this expression neither Qiz ~ '), which represents the dynamic properties of 
the errors, nor ^(z“ *), which must be estimated, is known The solution chosen 
m the generalized least squares method, due to Clarke (1967), is to estimate the 
parameters y = (> i of ^(z" ') along with 0 For this estimation problem 

the generalized least squares method uses an iterative scheme which may be 
described as follows 

Suppose that the estimate of y= (j-i y,)^ obtained in the /th iteration is 
|i/] the polynomial ^(z" *) having these coefficients is 

Thenthe(/ + l)th iterationconsistsofthefollowingsteps 

(a) Computer (fi) = 6*^(z"‘)u(n)and u(n) = ‘)u(n)forn = 1, ,N 

(b) Minimize 

I [/l(z-').v'(n)-5(z ')u(n)]" 

n K*l 

With respect to t Let the result be i = (d, bo 

(c) Compute ^(n) * ^(z ‘)H<n) - 5(z"‘)u(ii)forn = 1, ,iV, where -3(2’*) 

and B{z have been obtained by substitulmg t in A(z~ *) and B{2~ ') 

(d) Mmimize 

X rG(z ^)q{n)y 

n=L*l 

With respect to g = (pj gi}^ Take the result as 

(e) Stop if convergence has been achieved If not, repeat from (a) As initial 

condition = 1 may be used 

With respect to the generalized least squares method the following remarks can 
be made In the first place this method is known to work satisfactorily in 
practice Furthermore, as is shown by equations {8 9) and (8 10), the parameters 
characterizmg the dynamic properties of the errors are estimated along with the 
parameters of the system This may, for example, be of importance for automatic 
control applications Finally, it is observed for normally distributed g(n) the 
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solution is the maximum likelihood estimate. For other approaches to maximum 
likelihood estimation of the parameters of linear discrete-time dynamic systems, 
including the multi-input multi-output case, the reader is referred to Goodwin 
and Payne (1977). This reference also describes a recursive version of the 
generalized least squares method. 

A further simple, straightforward method for estimation of parameters of 
linear discrete-time dynamic systems is the following. 


The instrumental variable method 
Suppose that the system is described by 

y(n) q- aiy(n - 1) -t- • • • -1- a^y(n - K) 

= PoU(n) + Piu(n - 1) + ... + p^u(n - M) 

and that the problem is to estimate 0 = (aj • • • ' PmY from observations 

t)(l — K), w(l — K ), . . . , u(l), w(l), . . . , v(N), w(N) where 

w(n) = y(n) -f p(n) and v(n) = u(n) -f q(n) 

where g(«) and p(n) are stochastic processes representing possibly correlated 
non-systematic errors in the observations of the input and the corresponding 
response respectively. Furthermore, suppose that in addition to these observa- 
tions K + M + 1 further sequences of observations are available described by 

/2(l),...,/2(iV) 


/jC + M+l(l)> • • • 5 /jC + M+l(i'^) 

where the/,(n) are stochastic processes con variant with y(n) and u(n) but not 
with p(n) and g(n). The /,(«), n — will from now on be referred to as 

instrumental sequences. Multiplying the left-hand and the right-hand members 
of the system difference equation by the instrumental sequences and taking 
expectations yields 

+ • • • + 

= ^oCuf.m + --- + PMCuf.m 1== + M + I 

where the Cyj-^(k) and the Cuf^im) are the cross-covariance functions of y(ri) 
and f,{n) and of u(n) and /,(n) respectively. So, if these exact cross-covariance 
functions were available the unknown system parameters could be computed 
by solving the set of linear equations 

C,,^,(0) + + • ■ • + 

= boC,^,(0) + ... + l=i,...,K + M-bl (8.11) 
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for (Ji, , Ofc, bo, , b^f However, the exact cross-covanance functions are 
not known Since p(n) and £(n) are not covaiiant with the/,(n), estimates of the 
system parameters can then be computed by replacing the covariances in 
equation (8 11) by the corresponding mean lagged product estimates 

^ E - WiW 

and 

1 ^ 

C.J.W = jj'L m)/,(n) 

and subsequently solving the resulting set of linear equations 

= Bot,f(Q)+ 1=1, .K + M + 1 

fort = (flj UjcSo TheestimaioriisusuallyreferredtoasmstrMmenMl 

variable estimator As is shown in Jenkins and Watts (1968) mean lagged product 
estimators are under very general conditions consistent Furthermore, a 
continuous function of a consistent estimator of a parameter is a consistent 
estimator of the function of that parameter (see Wilks, 1962) It is concluded that 
the instrumental variable estimator is consistent under very general conditions 
as well 

The above considerations show that computationally the instrumental 
variable estimator is very attractive It is closed form, no iterative procedures 
are required and consequently convergence problems are avoided On the other 
hand, generally the instrumental variable estimator has no optimal statistical 
properties, and does not estimate the parameters of the errors Furthermore, the 
applicability of the estimator crucially depends on the availability of the required 
instrumental sequences However, if one instrumental sequence has been found, 
the remaining instrumental sequences required may be chosen as time shifted 
versions of the first one For example, if /,(n) is available the remaining instru- 
menfai’ sequences may 6e taken as = /j(h — rf + k = 2, 5“, TTrrfa' 
illustrated in Examples 811 and $ 12 

Example 8 11 Open loop estimationusing instrumental variables Intheesti' 
mation problem of Figure 8 1, time shifted versions of the input «(«) may be taken 
as instrumental sequences if u(n) and p(n) are not covariant Also time-shifted 
linearly filtered versions of the input are chosen On the basis of the available 
knowledge about the dynamic properties of the system under investigation the 
filter is chosen so as to approximate these properties The purpose is to ensure a 
relatively strong covariance between the observed response and the instrumental 
sequences A recursive instrumental variable algorithm implementing this idea 
IS described in Chapter 7 of Goodwin and Payne (1977) 
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dn) 






v(n) 




r(n) 


w(n) 




v(n) - u(n) + p(n) w ( n ) =y(n) + q ( n) 

Figure 8,2 Model used for estimation of a closed-loop control system 


Example 8.12 Closed loop estimation using instrumental variables. Suppose 
that the problem is the estimation of the parameters of the transfer function 
JfsCz" in Figure 8.2. In this figure is the transfer function of a con- 

troller, x(n) is an input process at the set point, while r(n) is an additive stochastic 
process at the system output equivalent to all disturbances introduced anywhere 
in the loop. Furthermore, u(n) is the component of v{n) generated by x(n) while 
^(n) is the response of the system to u(n). The processes p(n) and q(n) result from 
r{n). It is assumed that x(n) and r(n) are independent. Now suppose that observa- 
tions of v{n), w{n) and x(n) are available. Then shifted versions of the observa- 
tions of i;(«) may not be taken as instrumental sequences since v(n) is covariant 
with r(n), which is the part of \v(n) not causally related to v(n) and which, 
therefore, represents the errors in the measured response. Shifted versions of 
x(n), however, are feasible instrumental sequences since these are independent 
from both p{n) and g{n). 


8.4.5 Test Signals 

In physical practice, the input u{n) is either a sampled version of a continuous 
time normal operating process or it is a sampled version of a continuous time 
test signal, that is, an externally generated process that is deliberately introduced 
into the system for estimation purposes. Test signals have a number of ad- 
vantages over normal operating inputs. In the first place test signals need not be 
measured, they are accurately known and repeatable. Furthermore, it is 
usually fully justified to assume that they are independent of all other processes 
present in the system. Moreover, within physical limits their power spectrum 
and shape as a function of time can be freely selected. The synthesis of test 
signals will now briefly be discussed. This discussion will be restricted to periodic 
test signals since the most important practical test signals used in system 
parameter estimation belong to this class. 
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SyTithesis of practical periodic test signals 

A penodic test signal having any specified power spectrum can easily be synthe- 
sized by adding harmonics of suitably chosen amplitude However, the resulting 
harmonic sum signals are, due to their complicated shape, often very difficult 
to introduce into practical systems without nonhnear distortion Moreover, 
they often have a \ery unfavourable ratio ofr m s to peak value Both difficulties 
can often be avoided by using binary test signals In the literature these are also 
referred to as tHO-leiel signals Binary signals switch between two different, 
fixed amphtude levels only In addition, to facihtate generation using a general- 
purpose or special-purpose digital computer, practical test signals are preferably 
discrete interval signals, that is, they only change amplitude at mtegral multiples 
of a fixed time interval Periodic, discrete-interval, binary signals combine the 
above desirable properties and will, therefore, now briefly be discussed 
The best known example of a periodic, discrete-interval, binary test signal 
IS the maximum length binary sequence These signals arc extremely simple to 
generate and have, in addition, spectral properties which make them attractive 
for a variety of applications The generation of maximum length binary sequences 
may be described as follows The modulo-2 sum of the content of the last stage 
and that of one or more of the other stages of a binary shift register are fed back 
to the input of the first stage For certain suitably chosen feedback combinations 
the sequence formed by the successive content of a particular stage repeats 
Itself after » 2' — 1 shift pulses where K is the number of stages It can be 
shown that this is the maximum achievable sequence length with modulo-2 sum 
feedback, hence the name The periodic discrete-interval version of the maxi- 
mum length binary sequence is obtained by applying the shift pulses eqm- 
distantly and taking the content of a particular stage, which is constant durmg 
the shift pulse interval, as output of the generator Appropnate feedback 
combinations are tabulated in Peterson (1961) For a further discussion see 
Hoffmann de Visme (1971) and Davies (1970) The spectral properties of maxi- 
mum length binary sequences may be summarized as follows Define the com- 
plex Fourier coefficient of the fcth harmonic of a signal u(t) periodic with Tas 

1 r*- 

Then it can be shown that for any maximum length bmary sequence having 
amplitude levels + 1 and — 1 
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(see Hoffmann de Visme, 1971). So for any maximum length binary sequence 
the envelope of the power spectrum is similar. It can also be shown that the 
relatively flat part of this spectrum around the origin contains the major part 
of the total power. The latter property makes maximum length binary sequences 
very suitable to, more or less uniformly, cover the bandwidth of the system 
under test. 

As will be discussed later in this section, the power spectrum of the test 
signal influences the precision of an estimator of the parameters of a dynamic 
system. So to improve the precision, the use of a discrete-interval binary test 
signal having a power spectrum specified by the user and generally different 
from that of a maximum length binary sequence may be preferred. The same 
applies if certain estimation methods, specialized to periodic test signals and 
discussed below, are used. It may then be advantageous to employ periodic test 
signals that have the major part of their power concentrated in a specified way 
in a small number of relatively widely spaced harmonics. Periodic binary signals 
having this property are usually referred to as binary multifrequency signals. A 
simple method for synthesis of periodic, discrete-interval, binary signals, 
multifrequency signals in particular, having approximately a given power 
spectrum is described in Van den Bos and Krol (1979). This method minimizes 
in the least squares sense the difference between the Fourier amplitude spectrum 
of the signal and a specified amplitude spectrum. Finally, it is worthwhile to 
mention that the computation of the Fourier coefficients of a periodic discrete- 
interval signal from the Fourier coefficients of discrete time, that is, sampled 
versions and the converse is particularly simple and is described in Van den Bos 
and Krol (1979). 


Estimation using periodic inputs 

The discrete-time estimation methods to be briefly discussed now are only 
applicable if the test signal is periodic. The complex Fourier coefficients yi^ 
of a discrete-time signal x(n) periodic with J are defined by 

1 

7ix = 7 Z ^(«)exp( -}2ninJI) i = 0, ...,/- 1 

For an extensive discussion of the theory of discrete Fourier analysis, which in 
many respects is analogous to its continuous time counterpart, the reader is 
referred to Oppenheim and Schafer (1975). Chapter 5 provides a review. 

Now consider again the estimation problem of Example 8.12 and suppose that 
x(n) is periodic with I. Then u(n) and y(n) are also periodic with /. In addition, 
assume, as is suggested in Example 8.12 that the required instrumental sequences 
are chosen as x(ti — I + 1), I = 1, K + M + 1. Then using elementary 
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properties of discrete Fourier coefficients one can easily show that the equations 
(8 1 1) are equivalent to 

= 0 (8 12 ) 

where s, — e%pi22ni/N) and / = 1, ,K + M + I These are linear equations 
m the unknown parameters They could be solved if the exact F ouner coefficients 
and y,y, i = 0, , / — 1, were available, which is not the case However, by 

replacing the y,„ and y,y in equation (8 12) by corresponding estimates and 
subsequently solving, estimates of the system parameters are obtained Con- 
venient estimators of y,, and are 

j N-l 

1‘> = TS T. »(n)<:xp(-j2iira//) 

^ It o 

and defined correspondingly In this expression N is supposed to be an 
integral multiple of the period I Tben under very general conditions with respect 
to p(n) and gfn) and 2.(i are consistent (See Levin 1959) Since the proposed 
estimator of the system coefficients is a continuous function of the^m and 2,1 
It is consistent if the and 2, are 

A related method is the following Consider the least squares criterion 

-'{0= l' |/f(s, -B(s, ■■))’, .1" 

1-0 

where t = (ai b(, bst)^ Then equating the gradient of J(0 to zero and 
subsequently using some elementary properties of discrete Fourier coefficients 
yields the following necessary conditions for a minimum 

'z [^(5,W - = 0 f;=I, ,K (813) 

■ »0 

and 

/ I 

Z ~ = 0 m = o, i, ,m (su) 

1 = 0 

Since the exact Fourier coefficients y,„ and y,, satisfy 

■s^(sr')v„ - ®(sr')y.- = 0 

It follows that at t = 07(t) is zero Moreover, since J{t) is quadratic m t, t = 6 
IS a unique minimum Now, if, as above, the y,„ and y,, are replaced by cor 
responding consistent estimators, equations (8 13) and (8 14) become compu 
tationally simple, closed form, consistent estimators of the system parameters 
Moreover, comparing equation (8 12) with equations (8 13) and (8 14) one 
observes that the latter estimator employs shifted versions of the estimated 
periodic component of input and response as instrumental sequences m equa 
tions (8 13) and (8 14) respectively 
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Both methods described have the advantage that the Bode diagram can 
directly be estimated from the estimated Fourier coefficients. This is useful as a 
check of the feasibility of the observations. Also, if observations are missing 
or erroneous, the total loss of observations may be restricted to the loss of one 
period only. Furthermore, for suitably chosen I the computation of and %y 
from v{n) and w(n) can very efficiently be carried out using a fast Fourier trans- 
form (FFT) algorithm. FFT algorithms are described in Oppenheim and 
Schafer (1975). In this volume the FFT is discussed in Chapter 5. On the other 
hand, if periodic test signals are used having all or the major part of their power 
concentrated in a small number of harmonics, both the amount of data and the 
computation time may considerably be reduced if only the Fourier coefficients 
corresponding to these dominant harmonics are computed and included in the 
estimators, equations (8.12) or (8.13) and (8.14). Finally, as opposed to virtually 
all other discrete-time estimation methods, these periodic test signal methods 
have continuous time counterparts, that is, methods for estimation of parameters 
of differential equations. An example of such a continuous time method will 
briefly be discussed in Section 8.4.7. 


Precision of dynamic system parameter estimation 

This section discusses the MVB for estimation of parameters of linear discrete- 
time systems. In particular the dependence of the MVB on the input will be 
studied. This study will be restricted to the estimation problem of Figure 8.3. 
In this figure the stationary stochastic process y{n) is the response of the system 
to the stationary input process u(n). Furthermore, it is supposed that the re- 
sponse observations w(n) are the sum of y(n) and a stationary stochastic process 
v(n) which represents non-systematic errors. The process v(n) is modelled as the 
stationary response of a system having a linear transfer function to an iid, 
N(0, crj process e(«). Now assume that in Figure 8.3 

1) = ^(z- ^)ls^{z - ') and ^^{z - = ^(z" ^/©(z- 

where 

j/(z“^) = 1 -f ajz"^ -f • • • -f 
^(z = fo + fiz ‘ -!-•••-)- ^ji/Z 

S){z~^) = 1 -t- 5iZ“‘ + ••■ + 

^(z"^) = 1 + yiz“i + ■■• + 

Now suppose that 6s = («! • • • % i^o • ■ • PmV is estimated from observations 
y(l)) w(l), • . . , w(N) and that, in addition, the error parameters Gp = 
(^1 • • • djr. yi • • • (jeY are unknown. Then the subject of this section is the 
MVB for unbiased estimation of Gg. 
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Figure 8 3 Model used for study of dependence 
ofMVB on the input 


The information matrix I for the problem considered is defined by 
I = -E(d^ In L/dO^) 

where 0 = (Oj and L is the likelihood function of the observations 
>iiW L can be computed from the probability density function of 
£(1) ,£(//) defined by 

f,(t) = (2n) 

and the relation 


£(«) = JTd ‘(2 ‘)[!^n) - ^s(2"0y(n)] 

which follows from Figure 8 3 Then as is shown in Goodwin and Payne (1977) 
the information matrix computed from this likelihood function assumes the 
following form 



where 


Is = -E(d^ In L/d^) and Ip = -E(d^ In L/dO^) 

Note that Is is the information matrix for estimation of 9s if 0 d is known Then 
the MVB for unbiased estimation of 0 becomes 



Hence, for the model of system and errors concerned, the MVB for the system 
parameters 0s does not depend on whether or not the error parameters Op 
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are known. Therefore, in what follows only Ig will be studied. Goodwin and 
Payne (1977) show that Ig may be put in the form 

Is = N f J(ft))S„„(a;) doi (8.15) 

J — tr 

where the elements of the matrix J(co) are functions of the frequency and are 
parametric in 0s and Bp while S„u(co) is the power density spectrum of the input 
ufn). Then the feasibility of an experiment for the pertinent parameter estimation 
objectives can be investigated by computing the MVB from equation (8.15) 
for given 5u„((u) and nominal values of 0s and B^,. Also, if w(n) is a test signal, 
the achievable precision with different test signals under the same circumstances 
may be computed and subsequently compared. The question may then arise how 
to design an optimal test signal, that is, a test signal the power spectrum S„u((w) 
of which optimizes a chosen criterion defined on the elements of the MVB. 
This problem is extensively discussed in Goodwin and Payne (1977). Here only 
some important aspects of optimal test signal design will be reviewed. 


Design of optimal test signals 

Equation (8.15) shows that the elements of the MVB can be made arbitrarily 
small by increasing S„u(a)). In practice, however, the amplitude or the power 
of the test signal is always restricted. Here it will be assumed that the input 
power al satisfies the constraint 

ffu = ^ J 5„„((o) dco = 1 

Now consider the class of L x L information matrices described by the general 
form of equation (8.15). Furthermore, define two S„„(w) satisfying the above 
constraint as equivalent if the corresponding information matrices are equal. 
Then it can be shown that for any power spectrum there exists an equiva- 
lent line power spectrum S'u„(aj) consisting of at most ^L{L -1- 1) + 1 lines. Then 
S'uuicti) is of the form 

L' 

SL(co) = Z ^ 0 

1= ~L' 

where Sf 5(0) — oi.) is a Dirac delta function of area s,- at oj = o),-,L' = i-L(L -f 1) 
+ landa)_i = —co,-. So for any input one can find an equivalent input consisting 
of a finite number of sinusoids. It is emphasized that the above-mentioned 
number of lines is the maximum required number. It can often be substantially 
reduced by making use of special properties of the information matrix for the 
estimation problem at hand (see Chapter 6 of Goodwin and Payne, 1977). The 
importance of the above considerations for optimal test signal design is that 
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one need only optimize the chosen cntenon with respect to a relatively small 
number of variables co, and s, Frequently used criteria are the trace or the de- 
terminant of the MVB 

Often, in practice, a priori knowledge about the system and errors is available, 
being obtained from pre\ious experiments or mathematico-physical analysis 
Optimal test signal design techniques provide a means to exploit this knowledge 
to improve the precision Further applications of these techniques may be 
establishing theoretically justifiable rules of thumb for test signal selection and 
comparing the performance of test signals frequently used m practice to what 
optimally can be achieved under the same circumstances 


8.4.6 Connections Between Differential and Difference Equation Models 


This subsection discusses a number of aspects of estimating the parameters 
0 = (ao “jc - 1 ^0 PkV of ihc differential equation model 


IT 


+ 




d'-'jd) 


+ 


+ + 


+ Ml) (816) 


from obscr\ations of the input M(r) and the response >(t) This type of model 
directly results from continuous-time physical laws, for example, conservation 
laws Consequently, the coefficients usually have a clear physical meaning In 
applied and experimental physics it is often these coefficients or functions of them 
that are to be measured, as opposed to in automatic control applications where 
any model, perhaps discrete time or of an order lower than that of the physical 
model, is feasible as long as it accurately describes the response to any input that 
can be expected 

Analysis of the discrete-time estimation methods described earlier in this 
chapter reveals that the continuous lime analogs of these would all require 
numerical differentiation of the observations with the exception of the frequency 
domain methods discussed m Section 84 5 The continuous-time analog of one 
of the latter methods is described in Section 8 4 7 If the observations are in 
analog form, numerical differentiation without bandwidth limitation usually 
gi\ es rise to unacceptable signal-to-noise ratios On the other hand, bandwidth- 
limited differentiation is an approximative technique, the effect of which on the 
estimation results is difficult to trace Also, differentiation schemes for discrete- 
time observations are essentially approximative A further possibility, the use of 
hybrid computer methods for estimation of continuous-time system parameters, 
will not be discussed here The interested reader is referred to a survey by 
Piceni and Eykhoff (1975) As compared with the number of applications of 
estimation methods using digital computers, nowadays the number of pertinent 
hybrid computer applications is relatively small 

From the above considerations it may be concluded that using discrete-tune 
methods for estimation of parameters of continuous-time systems would be 
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attractive. However, this requires a one-to-one relation between the discrete- 
time and continuous-time parameters. For discrete-interval inputs such a 
relation exists and will now briefly be discussed. 

First the differential equation (8.16) is rewritten in a state-space form. The 
particular state-space representation chosen is 

x(f) = Ax(i) + bH(f) 

(8.17) 

y(t) = cx(f) -1- dii(t) 

where 


and 



0 

0 

1 


0 

0 

0 


0 

0 

0 

1 




■“ <^oPk 
- ^k-iPk 


c = (0 ••• 0 1) 




x(t) = (Xi(f) • • • XK(f))'^ 


For a general discussion of the state-space description of linear systems see 
Ksvakcrnaak and Sivan (1972). In this reference it is also shown that the solution 
of the above state-space equations is described by 

x(0 = exp[A(f - fo)]x(fo) + f exp[A(f - fo)]bn(t) df (8.18) 

•'•o 

where x(to) is the vector of initial conditions and exp X is defined as the Taylor 
expansion 


expX = I -t- X/1! + Xyil 


Now suppose that u{t) is discrete-interval with interval A, that is h(i) is constant 
for ijA < t < (n -t- 1)A for all ;j, and define u'(n) as the value of u(f) on this 
interval. Furthermore, define x'(n) as x(/iA). It then follows from equation (8. 1 8) 
that 


x'(« -f 1) = A'x'(n) + b'u'(fi) 
rXn) = c ' x '( h ) + dV(n) 


(8.19) 


where 


A' = exp(AA) b' = A '[A' — FJb 
c' = c cV — d 


( 8 . 20 ) 
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For a proof sec Kwakemaak and Sivan (1972) The discrete-time system 
(equation (18 19)) and the continuous-time system (equation (8 17)) are called 
eqiinaleni systems since at i = nA Ihcir respective responses coincide 
The relations (equation (8 20)) show how the equivalent discrete-time state- 
space description can be computed from the continuous-time description 
The solution of the converse problem will now be considered For a more 
extensive discussion see Harris (1979) Suppose that the system under test is 
described bj equation (8 16) and that the input «(0 is discrete-interval Further- 
more, suppose that the parameters 0 = (Oi P ’0 P’k)^ of the equivalent 

difference equation 

/(n) + - 1) + + flfcV (« -A ) = /ffli/f/i) + +/?'Ku(n-A') 

base been computed from samples > (n) and u (n) of >-(t) and u(i) respectively 
Then this difference equation is first rewritten m a state-space form The par- 
ticular form chosen here is 

x(« -K 1) = Ax(n) bti(n) 
y(«) = cx(n) -I- </ii(n) 

where 



0 0 

0 -a, \ 


/ 1 

0 0 

0 -5«-l\ IPk 


A = 0 

1 0 

0 -iTi-i) b =1 


\ 


/ V< 

- “iA/ 

\o 


1 -a, / 



c = (0 

0 1) rl = /)„ 



and 

X (n) = (Jc,(n)’Ci(n) Vjtfn))^ 

Now let the eigenvalues 2j, of A be distinct For the case of multiple 

eigenvalues see Harris (1979) Furthermore, define si as the eigenvector cor- 
responding with Ak Then it can be shown that the eigenvalues 2,, , 2* of the 

A matrix of an equivalent continuous-timedescnption satisf> 

/l, = i|nA, 1 = 1. ,K (821) 


while 


A=SAS-‘ 


(822) 
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where A = diag(Ai, . . . , Ak) and S' is the matrix having the s'^ as columns. 
Moreover, it follows from equation (8.20) that 

b = (A' - I)-‘Ab' (8.23) 

while 

c = (0 • • • 0 1) and d = P'q 

So an equivalent continuous-time description can be obtained from A' and B' 
by first computing the eigenvalues and corresponding eigenvectors of A' and 
subsequently using equations (8.21), (8.22), and (8.23) for the computation of A 
and B. The computation of the scalar differential equation (8.16) from these 
matrices is then straightforward. 

Generally, the eigenvalues are complex. So 

AfcA = In AJ. = InjAfcl -1- j(arg A^ -f- Inn) 

where n = 0, +1, ±2, Hence, if for the imaginary part of A^ A the principal 

value —Tt < Im(ln A^) < ;r is chosen, this is justified only if — rr < A Im A^ < rr, 
thatis,if|Im AkI < n/A,k — 1, ..., K. It is concluded that the sampling frequency 
in rad s"' should exceed twice the largest imaginary part found among the 
eigenvalues A^, k = 1, . . . , X. 

In practice, the above relations between equivalent representations are also 
used if u{t) is not strictly discrete-interval but does not change substantially 
between consecutive sampling instants. The results are then, of course, approxi- 
mations. Finally, it is observed that the computation of the continuous-time 
description from its discrete-time equivalent is relatively complicated and time- 
consuming. In the next subsection it is shown that these computational difficulties 
can be avoided if u(t) is a periodic test signal, since in that case the parameters of 
the continuous-time system can directly be estimated from the observations 
without estimating the parameters of the equivalent discrete-time system first. 


8.4.7 Continuous-time Estimation Using Periodic Test Signals 
Suppose that the problem is the estimation of the parameters 

® ~ (*o ■ ■ * *K- I Po' " Pk)^ 

of the model 


d'^v(0 


~^ + ... + ao)it) = P^ 


dVO 

dr^ 


-t- • ■ • + ^0 


from observations w(t) = y(t) + p(i) and i;(f) = «(£) + q(t) where p{t) and q(t) 
are stationary stochastic processes representing non-systeraatic errors and i/(f) 
IS a test signal periodic with 0. In addition, assume that the observations are 
available for 0 $ t ^ T where T is an integral multiple J of 0 and that y(r) 



374 


HANDBOOK OF MEASUREMENT SaENCE 


IS also periodic with O The latter assumption impbes that transient responses 
have vanished 

The exact Founer coefficients and of u(t) and >(t), defined m Section 
8 4 5, satisfy 

= 0 


where 


^(r) = ao + OiT + + ctK-ir^ ‘ + r*, 

«W = ^0 + ^ir + + 

and r„ = jlnm/Q Then using similar arguments as in Section 8 4 5, one can 
easily show that necessary conditions for a minimum i= (fio 1 Bo Bg)^ 

of the least squares criterion 

■/(!)= z 

I « I 
1*0 

are 

i:i‘i(-rM,-S(-rMJrUy,„=0 k = 0. ,K-l (824) 

1 

and 

ia(-r.,)y:„-fi(-r„,),:,.)ri,y.,. = 0 I: = 0, ,K (825) 

where / = ± 1, ± 2, , ±L, the m, are the harmonic numbers of the harmonics 

taken into consideration and m, = -m_i Moreover, this minimum is unique 
and t = 0 So the equations (8 24) and (8 25) are a set of 2K + 1 linear equations 
in the 2/C + 1 unknown parameters 0 They can, therefore be explicitly solved 
for the exact parameters 0 if the and y„,„ are known which is, a result of the 
errors in the observations, not the case However, if the y„,„ and y„,, in equations 
(8,24) and (8 25) are replaced by corresponding consistent estimators, (8 24) 
and (8 25) become a consistent closed-form estimator of 0 Practical estunators 
for y„, and y„„ are 

1 f-'® 

and defined analogously Levin (1959) has shown that under very general 
conditions with respect to p(t) and g(t)2m, and are unbiased and consistent 
With respect to the proposed estimator of 0 the following remarks can be 
made In the first place, comparing equations (8 24) and (8 25) to equations 
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(8.13) and (8.14) shows that it may be considered to be an instrumental variable 
estimator using estimated differentiated versions of the relevant periodic com- 
ponent of the response and the input as instrumental sequence. Furthermore, 
this continuous-time estimator has the same advantages, computational 
simplicity in particular, as its discrete-time counterpart described in Section 
8.4.5. For a more detailed discussion of the estimator, including numerical 
examples and an evaluation of its precision, the reader is referred to Van den 
Bos (1970, 1974). 


REFERENCES 

Clarke, D. W. (1967). ‘Generalized-least-squares estimation of the parameters of a dynamic 
model’, in Preprints 1st IFAC Symp. on Identification in Automatic Control Systems, 
Academia, Prague. Paper 3.17. 

Cramer, H. (1961). Mathematical Methods of Statistics, Princeton University Press, 
Princeton. 

Davies, W. D. T. (1970). System Identification for Self-Adaptive Control, Wiley, London. 

Draper, N. R. and Smith, H. (1966). Applied Regression Analysis, Wiley, New York. 

Durbin, J. (1960). ‘The fitting of time-series models’. Revue Inst. Int. de Stat., 28, 233-43. 

Eykhoff, P. (1974), System Identification, Wiley, London. 

Fedorov, V. V. (1972). Theory of Optimal Experiments, Academic Press, New York. 

Goodwin, G. C. and Payne, R. L. (1977). Dynamic System Identification: Experiment 
Design and Data Analysis, Academic Press, New York. 

Griffiths, J. W. R., Stocklin, P. L. and Schooneveld, C. Van. (1973). Signal Processing, 
Academic Press, London. 

Harris, E. L. (1979). ‘Using discrete models with continuous design packages’, Automatica, 
15, 97-100. 

Haykin, S. (1979). Nonlinear Methods of Spectral Analysis, Springer, Berlin. 

Hoffmann de Visme, G. (1971). Binary Seguences,The English Universities Press, London. 

IEEE (1974). IEEE Trans., AC-19, No. 6., complete issue. 

IFAC (1967). Preprints 1st IFAC Symp. on Identification in Automatic Control Systems, 
Academia, Prague. 

IFAC (1970). Preprints 2nd IFAC Symp. on Identification and Process Parameter Estimation, 
Academia, Prague. 

IFAC (1973). Proc. 3rd IFAC Symp. on Identification and System Parameter Estimation, 
North-Holland, Amsterdam. 

IFAC (1978). Proc. 4th IFAC Symp. on Identification and System Parameter Estimation, 
North-Holland, Amsterdam. 

IFAC (1979). Proc. 5th IFAC Symp. on Identification and System Parameter Estimation, 
Pergamon, Oxford. 

IMSL (1977). IMSL Library 1, International Mathematical and Statistical Libraries, 
Houston. 

Jenkins, G. M. and Watts, D. G. (1968). Spectral Analysis and Its Applications, Holden- 
Day, San Francisco. 

Jennrich, R. I. (1969). ‘Asymptotic properties of non-linear least squares estimators’, 
Ann. Math. Stat., 40, 633-43. 

Kaufmann, H. C. and Akselsson, R. (1975). ‘Non-linear least squares analysis of proton- 
induced X-ray emission spectra’, Adv. X-Ray Anal., 18, 353-61. 



376 


HASOBOOk OF MEASUREMENT SQENCE 


Kendall. M G and Stuart. A (1966) The Adianced Theory ofStaiisitcs, Vol 3. Isi Ed, 
GnfTin, Lxindon 

Kendall. \! G and Stuart, A (1967} The Adianced Theory of Statistics. \o\ 2, 2nd Ed, 
GnfTin. London 

Kendall. M G and Stuart, A (1969) The Advanced Theory of Statistics, Vol 1.3rd Ed, 
GnfTin. London 

Kwakemaak. H andSitan R (1972) Linear OplimatControl Systems, Wiley, Uc'f.Yotl 
LcMH M J (1959) Estimation of the charactenstics of linear systems in the presence of 
noise. D5r Thesis Electrical Engineenng DepL, Columbia Unisersity 
kfakhoul J (1975) Linear prediction a tutorial rcsiew *, Troc IEEE,63, S61-S0 
Mann H B and Wald, A (1943) 'On the statistical treatment of linear stochastic dif- 
ference equations, Econonie/nea II. 173-220 
Marquardf. D W (1963) An algorithm for least-squares estimation of nonlinear par- 
ameters’ J Soc Indus! Appl t/oift . 11.431-41 
Mood A M.Graybiil.F A.andBoes.D C (1974) introrfi/ctiontotlie T/ieory o/Statutics, 
McGraw-Hill Tokyo 

Murray W (1972) S’urnencal Methods /or t/neonsfruined Opfimiration, Academic Press, 
London 

NAG (1978) A.4G Fortran Library Manual, Mark 6, Numenca) Algorithms Group, 
Oxford 

Norden, R H (1972) ’A sursey of maximum likelihood estimation*. Int Stai Rei, 40. 
329-54 

Norden. R H ( 1973) A sursey of maximum likelihood estimation’. Part 2, Int Star Rer 
41,39 58 

Oppcnheim. A \ and Schafer, R W (1975) Digital Signal Processing, Prentice-Hall 
Englewood Cliffs 

Papoulis,A (1965) Prohabiltii Raniloml'anables.andStochosiicProcesses,McGTa't,’\\i\l 
New York 

Peterson. W W (1961) Error Correermt) Codes. Wiley, New York 
Piccni H A Land EykhofT, P (1975) The useofhybnd computers for system parameter 
estimation’. Ann de fA/CA, 17, no 1.9-22. 

Robinson EA (1967) Staiisriea/ Commumeotion end Z?ererrion. Griffin, London 
Schwartz. M and Shaw, L (1975) Signal Processing Discrete Spectral Analysis, Detection 
and Esrinuirion. McGraw-Hill London 
Seber, G A F (1977) Linear Regression Analysis Wilcy, New York 
Van den Bos. A (1970) ’Estimation of linear system coefficients from noisy responses to 
binary multifrequcncy signals’. Preprints 2nd IF AC Sywp on Idenltficaiton and Process 
Parameter Estimation, Academia. Prague Paper 72. 

Van den Bos, A (1974) 'Estimation of parameters of linear systems using pcnodic test 
Signals’, D Tech Sc Thesis, Delft University of Technology 
Van den Bos. A (1977) Application of stalislica] parameter estimation methods to 
physical measurements’. d Phys E Sci /mtrum, ID, 753-60 
Van den Bos, A (1979) *Small sample properties of a class of nonlinear least squares 
problems', in Proc 5th J FAC Symp onldentificationand System Parameter Estimation, 
Perpmon. Oxford Paper M16 

V’andenBos,A and Krot. R G (1979) *Synthcsis of discrete-intcnal binary signals with 
specified Fourier amplitude spectra', /nf J Control,30 871-84 
Van ^pcn. P , Nullens, H , and Adams. F (1977) ‘A computer analy-sis of X-ray fluor- 
escence spectra’, A uc/ Instrvm Me/A , 142. 243-50 
Van Trees, H L (1968) Dcfcciion, Estimation and Modulation Theory, Part I, Detection, 
Estimation and Linear Modulation Theory, Wiley, London 



PARAMETER ESTIMATION 


377 


Van Trees, H. L. (1971a). Detection, Estimation and Modulation Theory, Part 2, Non-linear 
Modtdation Theory, Wiley, London. 

Van Trees, H. L. (1971b). Detection, Estimation and Modulation Theory, Part 3, Radar, 
Sonar Signal Processing and Gaussian Noise Signals in Noise, Wiley, London. 
Wilkinson, J. H. and Reinsch, C. (1971). Handbook for Automatic Computation, Vol. II: 

Linear Algebra, Springer, Berlin. 

Wilks, S. S. (1962), Mathematical Statistics, Wiley, New York. 

Zacks, S. (1971). The Theory of Statistical Inference, Wiley, New York. 



Handbook of Measurement Science, Vol. 1 
Edited by P. H. Sydenham 
© 1982 John Wiley & Sons Ltd. 


Chapter 

W.J. KERWIN 

Analog Signal Filtering and Processing 



Editorial introduction 

At some stage in the information flow chain of an instrumentation system the need will 
arise to alter the frequency content of the signals. This, and the following chapter, discuss 
how signals in the electric form can be processed: this chapter explains procedures based 
on the analog, linear, signal form; Chapter 10 explains those based on the digital alterna- 
tive. Examples of this need are reduction of unwanted frequencies, such as mains-induced 
noise, bandwidth extension by selective processing of gains of the various frequencies, 
selective attenuation as needed in audio compensation systems, and bandwidth limitation 
for signal detection systems. 

Filtering was, in the main, originally developed to fill telecommunications needs arising 
from the 1880’s onward. By the 1930’s the design of passive network filters had reached a 
level of maturity. The arrival of computers and the inexpensive solid-state semiconductor 
active amplifier element made new and more complex networks viable. Thus developed the 
active filter in which a passive network was combined with an active amplifier. Active 
methods enabled circuits to be devised which did not need the expensive, lossy, inductor 
component and in which the numerical value of capacitors could effectively be multiplied 
up. During the 1970’s electronic filter design emerged from the realm of the circuit theorist 
to become an easy-to-use tool. Design books greatly helped this transition. This chapter 
reviews some dominant design methods in use and, as such, cannot replace the many 
design texts, a selection of which are mentioned. 


9.1 PASSIVE SIGNAL PROCESSING 


9.1.1 Introduction 

With the possible exception of certain applications where extensive optical 
filtering is valuable and cases where very limited spectral processing can be used 
in a more appropriate energy domain (examples are the tuning of mechanical 
systems, the selective insulation of thermal systems) the electric form of signal 
will be preferred as the input to a data processing stage. It is easier to implement, 
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the least expensive alternative, small m size, and can employ very powerful 
filtering strategies 

To begin a design study of a suitable filtering stage it will be necessary to 
first decide the specifying requirements What changes to the amplitude and 
phase of the components of input signal are required over the band of frequencies 
involved*^ This information is generally specified as the transfer function of the 
filter stage This can be realized by inspection of known responses, types, or by 
circuit theory derivation 

Additionally the stage may require uniform gam change (called amplification 
if unity or greater, attenuation if less than unity) It may be essential that phase 
shift IS kept m control or that it is shifted uniformly 

As will be seen m this chapter, the totally general and economic filter design 
does not exist Design involves choice of a selected design that best suits the 
above realized specification 

As a general rule compromise must be made between sharpness of the fre- 
quency roll-off away from the band-pass region and the time domain response 
Fortunately today it is a relatively simple, straightforward, and inexpensive 
matter to implement filter designs at the prototype stage Once tested for general 
suitability they can then be further refined to make them less sensitive to circuit 
component tolerance spreads and temperature dependence 

This much said the chapter begins with some definitions and an example 


Laplace notation 

We are concerned here primarily with signal modification and amplification, 
both steady state and transient We will discuss both passive and active networks 
The Laplace transform notation, s = <7 + jo), will be used throughout To 
differentiate between a steady state problem and a transient problem, we will 
use p *= joj for steady state and s = a -I- ja» for the transient case Impedance is 
then Lp or 1/Cp, and admittance 1/Lp or Cp for an inductor (L) or a capacitor 
(C) 


Transfer functions 

Discussion IS restricted to lumped, linear, finite systems, both passive and active 
Transfer functions will be given as output response over excitation, that is, 

Tip) = signal output 
signal input 

The roots of the numerator of the transfer function (the zeros of the function) are 
therefore the zeros of transmission of the system The poles of the transfer 
function (the roots of the denominator) are the points of infinite system response 
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The steady state magnitude response of the transfer function is easily obtained 
by separating T(p) into real and imaginary parts (p = jco), so as to obtain 


T(p) = 


■^2 + j -®2 


(9.1) 


then the magnitude of T(p) is 



The phase is given by 



(9.2) 


(9.3) 


Network analysis 

The usual mesh or nodal analysis could be used for all of the systems to be 
discussed here; however, the vest majority are ladder networks and much simpler 
analysis methods will therefore be used. The most appropriate method is com- 
monly called linearity. It uses the fact that in a linear network the transfer 
function is independent of the signal level. Thus, a value of 1 volt or 1 ampere is 
usually assumed. In addition, analysis proceeds from output to input. 


Example 9.1. Determine the steady state transfer voltage ratio vjv-, and the 
magnitude and phase of the network in Figure 9.1. 

Let Uo = 1 volt, then 

ia = fp(l) = Ip f = 1 + (fp)(|p) = 1 + 2p2 

h = = ip(l + 2p2) ii = i2 + h 

Vi = V + ii(l) = 1 + 2p^ + 2p + p^ 



1 


1 

p^ + 2p^ + 2p + 1 


I T(jj) _ 2^2)2 + (2m - m^)2] + 1) 


I(p)p=ja, = 0° - tan 1 



The above example used normalized element values (near or at unity fi, H, F) 
and has a cut-off frequency ( — 3 dB) which occurs at co = 1 rad s“ as can be 
seen from the magnitude expression |T(p)|„,^i = l/y/2. Note also that 
l^(p)L-.oo -*• 0 and has a magnitude of 1 at d.c. so that it is a low-pass filter. 
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Figure 9 I nurd order, low 
pass filter (units in iX H, F) 


Since the numerator of T(p) is a constant, there are no finite zeros and since the 
denominator is a cubic, there are three poles and the system is of third order 
In a later section we will discuss scalmg procedures to obtain any desired source 
impedance and cut off frequency In addition, we will transform low pass 
prototype filters to high pass or band-pass filters 


9.1.2 Low-pass Filter Functions 
Butterworth function 

The Butterworth function (Butterworth, 1930) is one of a class of functions that 
have maximally fiat magnitude (MFM) In addition to the MFM property, 
they are all pole functions (i e no finite zeros) The MFM function is one that 
has as many denvatives equal to zero as possible at cu = 0 A much simpler way 
of defining and denving the Butterworth polynomials is to realize that this 
definition is satisfied if the term by term magnitude coefficients of the numerator 
and denominator polynomials have the same ratio as exists at cti = 0 For the 
Butterworth this means that all o coeffiaents in the magnitude expression 
except the highest one are equal to zero In addition, all Butterworth functions 
are normalized so that the cut-off frequency a>_ 3 dB = 1 rad s~^ 

Example 92 Fmdthevaluesofthecoefficientsforathird order Butterworth 
function having unity gam at d c 
Let 

ap^ + bp^ + cp + 1 

thus 

I 7’Cp) I ^2^6 0^2 _ 2ac)(o* + (c^ — 2b)(0^ + 1 

therefore 

— 2ac = 0 c* — 26 = 0 
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and since then 


\T(p)\^ = 


1 

+ 1 


1 

2 ’ 


that is, at CO = 1 the function is down 3 dB. Therefore a = 1 so that b — 2, 
c — 1 and 

+ 2p^ + 2p + 1 


Depending on the design problem, we may be working directly from the 
polynomial coefficients or from the pole positions so that both forms are of 
value. The normalized pole positions are given by: (a) they are on a unit circle; 
(b) they are all 180°/n apart; (c) there is a single real pole at — 1 for all odd order 
functions and complex pairs only for all even order functions; and (d) they are 
in the left half plane. 

The magnitude response of all normalized Butterworth functions 
(unterminated) is given by 


thus 


or 




1 

0 )^" + 1 


(9.4) 


>l(dB) = 20 log 




(9.5) 


0 ) = ^( 10 "'*/^® - 1 ) 

log(10-^^^° - 1) 

n = — 

2 log w 


(9.6) 

(9.7) 


so that given .4 at a certain co, the required n can be determined. 


Example 9.3. A magnitude of —40 dB is required at cu = 3.5 rad s ^ for a 
filter with a cut-off frequency of 1 rad s~^. What order Butterworth filter is 
required? 

In this case, 


log(10‘'°/^° - 1) 
” ~ 2 log 3.5 

Thus, a fourth-order filter will be required. 


3.676 
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Table 9 1 Buttcnvorth denominator polynomials 
P+ 1 

p* + py/2 + 1 
+ 2p* + 2p + 1 

p* + 2613l3p* + 34142Ip^ + 261313p+ 1 
p’ + 3 23607p* + 5 23607p^ + 5 23607p^ + 3 23607p + 1 


The Butterworth function is one that will normally be used when flat 
magnitude is most important (without ripple), and where linearity of phase or 
freedom from transient overshoot to a step input is of lesser importance The 
slope approaches 6n dB/octave as to oo A few Butterworth polynomials are 
given in Table 9 1 

Thomson functions 

The Thomson function (Thomson, 1959) is an all pole function whose time 
delay IS maximally flat (MFD) The time delay is defined as TD = — d(^(to)/d(a 
where 0(to) is the steady state phase These functions can be derived in a similar 
manner to that used for the M FM functions after first finding the phase and then 
differentiating before applying the maximally flat procedure previously shown 
for the MFM functions These functions are frequently called linear phase 
functions 

The usual applications are in those systems where linearity of phase or lack 
of overshoot or oscillation in response to a step input is of primary importance 
Their magnitude response is a gradual decrease m amplitude to the “3dB 
point and then an increasing slope until 6n dB/octave is obtained 

Storch (1954) has given a simple method of finding these polynomials (also 
called Bessel polynomials) Table 9 2 gives the first five MFD denominators, and 
their —3 dB frequencies (all are normalized to unity time delay) 

Chebyshev functions 

The Cbcbyshev function is similar to the Butterworth and Thomson functions 
in that It IS an all pole function It is defined m terms of the steady state magnitude 
such that the magnitude varies between tolerance limits throughout the pass 
band These limits have equal maxima and minima and so the function is 
referred to as an equal ripple magnitude function The specification is the 
peak-to-peak magnitude of the ripple in dB These functions are very efficient 
in that they offer a faster transition from the pass band to the stop band than the 
previous functions, and, of course, maintain a specified tolerance in the pass 
band The disadvantage is a highly non-linear phase change with frequency 
worsening as the ripple increases, thereby seriously worsening the transient 
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Table 9.2 Thomson denominator polynomials (MFD) 
and ft>- 3 dB (in J'nd s' ') 


p + 1 1.0000 

p^ + 3p+3, 1.3617 

+ 6p2 + 15p + 15 1.7557 

p^ + 10p2 + 45p^ + 105p + 105 2.1139 

p= + 15p^ + 105p3 + 420p^ + 945p + 945 2.4274 


response. These functions have a high degree of overshoot to a step input and 
are highly oscillatory as they return to final value. 

Since the functions are dependent on the amount of ripple, it is necessary to 
tabulate them as a function of the specified ripple. It is customary to specify the 
pass band as the end of the equal ripple band and to normalize this to 1 rad s~ ^ 
Since the functions change as the degree of ripple changes, it is usual to tabulate 
a few specific ripple magnitudes. Here we will specify the polynomials in terms 
of the ripple and thus one set of equations will cover all possible ripples. 

Figure 9.2 shows an odd and an even Chebyshev function with a magnitude 
of one at CO = 0, as it would be in an unterminated filter, and the definition of e. 
Note that 20 log[,y(l + a^)] = zi(dB) of ripple. A is positive in all cases. 
Table 9.3 lists a few Chebyshev polynomials as a function of v, normalized to 
CO = 1 rad s"^ at the end of the ripple, v is defined in terms of the ripple 
magnitude A, for a given order n, as 



(9.8) 

1 

o 

O 

> 

11 

(9.9) 


To determine the attenuation at any particular stop band frequency, we have 


\TQco)\^ = 


1 

1 + cosh^(n cosh' ^ co) 


(9.10) 




Figure 9.2 Chebyshev magnitude versus frequency 
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Table 9 3 Chcbjshe\ po!>'nomtaIs 
p + smh r 

;>* + (.^ 2 smh t)p + smh^ f + i 
(/* + smh t)[/> + (smh i)/» + sinh^ t- + j] 

[p^ + (0 76537 smh r);» + smh^ t + 085355] [/*“ +(| 84776 smh i)p + smh^ t + 0 14645] 
(/I + smh + (061803 smh i)/» + smh* t + 0 90451] 

X £p’ + (1 61803 smh i)p + smh* t 4- 034549] 


The — 3 dB frequency is also of interest for the Chebyshev functions and is 



where c is defined by equation (9 9) 

Example 9 4 What order Chebyshev function (normalized) is required if the 
ripple -4 IS 0 1 dB and the attenuation is to be 60 dB at co =* 4 rad s" ‘ What is 
the -3 dB frequency'’ 

Solving for c (equation (9 9)) 

e = ^(10'* '®- I) = 015262 

Therefore from equation (9 10). c as above, and w = 4 rad s" ‘ 

-60 dD = 20 locH i i i— V ^ 

^Lv «*cosh*(n cosh"* oj)/ J 

Solving for n, we get /I = 4 595 Thus.n = 5 is required From equation (9 II). 
we have 


CU_ jjB 



1 135 rads"* 


Example 9J Determine the denominator polynomial for a fourlh*ordcr 
Chcbjshci (n = 4) of 003 dB ripplc(ic A = 003 dB) 

c = VOO'' 0 = 0083257 

From equation (9 8),i = (l/n)sinh"’(l/e) = 0 795175 and smh i> = 0S8I663 
From Tabic 9 3 the Chcbjshcv polynomial is then 

(p= + 06748p + I 6309Xp’ + 1 6291p + 09238) 
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Inverse Chebyshev function 

The inverse Chebyshev function has an MFM pass band and an equal ripple 
stop band produced by finite complex conjugate jco axis zeros. In this case, the 
MFM characteristic is obtained by making the ratio of the numerator coefficient 
to the denominator coefficient in the magnitude expression (term by term) equal 
to the value of | T(p) | at c« = 0. All inverse Chebyshev functions are normalized 
to a)_ 3 dB = 1 rad s“ T The finite jco axis zeros allow a much steeper roll-off and 
therefore greater discrimination; however, a stop band return occurs following 
the zero or zeros. 

A general form of a 22-3p function having no d.c. loss is; 


T(p) = 


dp^ -t- 1 

ap^ + bp^ -h cp -f 1 


(9.12) 


For this to be a normalized inverse Chebyshev function, we have three con- 
straints: (a) MFM; (b) co_ 3 dB = 1 rad s"*; and (c) specification of the finite 
zero position, oj_„. 

Finding the square of the magnitude of T(p) (equation (9.12)): 


Thus 


\T(j>)\Ui. 


— 2d(X)^ -t- 1 

ci^co® + (b^ — 2ac)a)'* + (c^ — 2b)(X)^ -f 1 


(9.13) 


Oi-co = or d ~ l/col^ (9.14) 


Equating coefficients (since ( T(p)\^^o = 1) to obtain MFM, we get 


— 2b = —2d 
b^ — 2ac = d^ 


at CO = 1 rad s ^ at which | T(p)p = j (—3 dB), we have 

d^ -2d + 1 1 


lT(p)|^ = 


+ d^ -2d + \ 2 


a = \ — d 


(9.15) 

(9.16) 


(9.17) 


then substituting a = 1 — d into equation (9.16) and solving equation (9.15) and 
equation (9.16) simultaneously, we find 

b^ - 2(1 - d)V(2h - 2d) = d^ (9.18) 

Thus a specification of the desired d (from (u_„) allows b to be determined, 
as well as a from equation (9.17) and c from equation (9.15). A minimum 
practical d exists which depends on the network used to realize this function, 
but is not a limitation. 
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Example 9 6 Determine the 2z~3p m%erse Chebyshev function having zero 
output at 3 0 rad s" ' (i e o_*. = 3 rad s“ *) 

We ha\c 

d = l/oioo = i 

thus 

a= 1 -d=| 

and from equation (9 18) 

- 2(i)jab - i) = (i)= 

This IS easily sohed using a programmable hand calculator, since we want only 
positi\e real roots Thus b ~ 1 8150, and from equation (9 15) 

c = ^(2b - ^) = 1 8450 

and the function is 

7',(P) = 


fp' + 1 8l50p' + I 8460p + I 


Similarly, other values of the zeros can be specified We can also set up 
2r - 4p, 2r - 5p, , inverse Chebyshev functions, as will be discussed shortly 

The more nearly the zero position approaches 1 rad s'\ the higher is the 
peak return m the stop band as shown in Figure 9 3 
For the curve shown with a zero at <a = 2 rad s” *, the transfer function is 


' ip^ + I 5856p^ + 1 6344p + 1 


The magnitude at the peak m the slop band (at to„„) is given by equation 
(9 20) for all unterminated 2z~3p inverse Chebyshev functions 


|7'(f’)L,«..(dB) = 


r (27/4)<ol,(a>l« 



Figure 9 3 Inverse Qjcbyshcv responses 
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The frequency of the peak stop band return is 


®^raax (^—co \/3 


(9.21) 


This function can be set up so as to remove a specific frequency or to give the 
best cut-off slope consistent with a specific stop band minimum attenuation as 
determined by equation (9.20). 

A 2z-4p function can be set up as 


having 


ap* + bp^ + cp^ + dp + I 
o>-ao = V(V^) or ^ = l/<yico 


(9.22) 

(9.23) 


Now finding the magnitude of equation (9.22) and equating the corresponding 
numerator and denominator coefficients, as before, as well as setting = 

1 rad s~^, we obtain the following results; 


a = l - e (9.24) 

4- 4d^e - Sd^[_2(l - e)(d^/2 -f e)] + 8(1 - c) = 0 (9.25) 

c = dV2 + e (9.26) 

b = V[2(l - e)c] (9.27) 


Example 9.7. Determine a 2z-4p inverse Chebyshev having (w_„ = 2 rad 
From equation (9.23) 

e = 1/co^co = i 

then, from equation (9.25) 

d^ + d^- Sdy/l^dyi i)] -f 6 = 0 

This is easily programmed and iterated to find the real positive values of i which 
are, d = 0.94048 and d = 2.25390. The higher value of d is the correct one in all 
cases, since it gives the polynomial with left half plane poles. Then, from equation 
(9.26), c = rf^/2 + 5 = 2.7900, and from equation (9.27), b = ^[2(1 — i)c] = 
2.0457, and from equation (9.24), a = 1 — e = |. Thus 

T(v) = 7^ - - 

-b 2.0457p3 -t- 2.7900p2 4- 2.2539p 4- 1 


This allows the determination of any 2z-4p inverse Chebyshev function based 
on a specification of aj_ „ only. For the 2z-4p function the stop band maximum 
occurs at 


®max UJ-oo-\/2 


( 9 . 28 ) 





IlWDBOOk OF MEASURE-MEVr SaEVCE 


and the magnitude at that point is 

1 = 20 ^ - 0^ )' ] 

For the prcuous example for o. « = 2 rad s~ = 2^2 = 2 8284 rad s~ 
and the magnitude at is —3363 dB For the 2r-3p function having 
tj_, = 2 rad s' *. the peak return was —2387 dB In addition, the magnitude 
of the slope following the peak reaches I2dB/octa\e as ru -► oo, whereas the 
2z-}p has an ultimate slope as <j -* » of only 6 dB/octave 
The detailed derivation of the 2e-5p inverse Chcb>she\ function will not be 
presented here, however, a verj useful result is the = 2 rad s'* function 

It IS 

f, V ip* * 

" ip' + 25I5V + 4219V + 43975;)' +28S01p+ 1 

for which 

= 7(5/3) (9 31) 

and 

ITWU™ = [l + - 1)=]’"' <”2) 

For the 2z-$p function of equation (930), a>m„ « 2.5820 rad s" ‘ and 
lT(p)l^^*s -4222 dB Of course, the ultimate slope as w -* oo is improved 
to 18 dB/octave in this case 

A comparison of the magnitude responses versus frequency of all of the 
functions presented above is given in Figure 94 Functions are scaled to 
o.j = I rad s' ‘ for the third-order case 



figure 94 Magnitude response versus frequency for third* 
Older filters 
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9.1.3 Low-pass Filter Design as the Basis 

The element values given here will be restricted to the two most important cases; 
the unterminated filter and the filter with equal terminations (R^ = i?L = 1 
We begin by study of design of normalized, low-pass procedures, extending 
these designs to whatever scale or frequency pass type is needed. 

Buttenvorth Filters 

For the terminated filter, the element values are given in Figure 9.5 for n = 2,3, 
4, and 5. Values are in (ohm), H (henry), F (farad). All cut-off frequencies are 
to_ 3 dB = 1 rad s~K 

In the unterminated case for Rs= 1 Q, Fl = oo, the element values are given 
in Figure 9.6. Values are in Q, H, F; again = 1 rad s" ^ 

Thomson filters 

For the terminated filter, values are given in Figure 9.7 for n = 2, 3, 4, and 5. 
For the unterminated case, the values are shown in Figure 9.8. All filters are 
normalized to one second time delay. All values are in fi, H, F; see Table 9.2 
for the value of (U_ 3 <ib. 


1 V2 12 



Figure 9.5 Equal termination Butterworth filters (units in H, F) 



Figure 9.6 Unterminated Butterworth filters (units in Q., H, F) 
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Figure 9 7 Equal termination Thomson filters (units Q, H F) 


Chebyshev filters 

Since the amount of npple can be specified as desired, the number of possible 
Chebyshev filters is doubly infinite A few Chebyshev filters are given in Figure 
99 All are normalized to end of ripple at to » I rad s“', all values are in Q, 
H.F 


/niersc Chebyshev filters 

Again we have a doubly infinite set of possible fillers since the stop band ripple 
can be specified as well as the value of n We will tabulate a few values of 
for the 2z-3p, the 2s 4p, and the 2z-5p fillers All have w.jdB * 1 rads * 
All values are in ft H, F is the frequency of maximum response in the slop 

band as shown m Figure 9 3 The schematic for the Iz-ip is shown m Figure 
9 10, and the element values for the 2r-3p filter are given in Table 94 The 
terminated filters are — 6 dB at a> =* 0, whereas the untermmated are OdB at 
G) = 0 m all cases 

The schematic for the 22-4p inverse Chebyshev filter is shown in Figure 9 U, 
and the element values for a few values of a>_«, are shown in Table 9 5 All values 
are in ft H, F 



Figure 9 8 Untermmated Tliomson filters (units in fl H,F) 
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c* OQ rhebvshevfillerelemcnlvalues(unitsinfi,H,F):(a)0.10dB 



Figure 9.10 2z-3p inverse Chebyshev filter 
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Table 9 4 2z 3p in\erse Chebyshev filter element values {in H, H, F) 


CJ <, 


|T(P)I^. (dB) 


c, 



c, 

2 

3 4641 

-2987 

1 

08172 

16344 

01530 

08172 

2 

3 4641 

-23 87 

CO 

02556 

09687 

02581 

1 3787 

3 

5 1962 

-4190 

1 

09230 

1 8460 

006019 

0 9230 

3 

51962 

-35 90 

00 

04013 

I 1794 

009421 

14447 

4 

69282 

-49 86 

1 

09574 

1 9149 

003264 

09574 

4 

69282 

-4386 

00 

04461 

12482 

005007 

14688 

5 

86602 

-5588 

I 

09730 

19459 

002056 

09730 

5 

8 6602 

-4988 

00 

04659 

12793 

003127 

14800 



Figure 9 11 2z 4p inverse Chebyshev filter 


The schematic for the 2r 5p inverse Chebyshev filter is shown in Figure 9 12 
and the element values for a few values of o>,<o are shown in Table 9 6 


Scaling la\isand a design example 

Since all the data previously given are for normalized filters, it is necessary to 
use the scaling rules to design a low.pass filter for a specific signal processing 
application 

Rule I All impedances may be multiplied by any constant without affecting 
the transfer voltage ratio 

Rule 2 To modify the cut-off frequency, divide all inductors and capacitors by 
the ratio of the desired frequency to the normalized frequency 


Table 9 5 2z-4p inverse Chebyshev fitter element values (in n, H, F) 


to.. 


ir(p)i^„(dB) 

Rl 


Cl 

1^2 

C2 

Cj 

2 

2 8284 

-3963 

1 

07350 

1 6723 

1SI89 

01646 

05816 

2 

2 8284 

-33 63 

oo 

03666 

08649 

12338 

02026 

1 3890 

3 

42426 

-55 19 

1 

0 7500 

1 7703 

I 7121 

0 06490 

06918 

3 

42426 

-4919 

00 

03761 

09928 

14327 

007756 

14693 


56568 

-65 65 

1 

07570 

1 8050 

17726 

003526 

07247 

4 

56568 

-59 65 

CO 

03791 

10332 

14973 

004174 

1 4965 

5 

70711 

-73 60 

1 

07600 

18205 

1 8001 

002222 

07396 

5 

70711 

-6760 

CO 

03804 

10512 

1 5265 

002620 

15089 
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Figure9.12 2z-5p inverse Cheby shev filter 


Table 9.6 2z-5p inverse Chebyshev filter element values (in Q, H, F) 



^max 

|T(p)L„JdB) 

c, 

L, 

a 

L2 



2 


-42.22 

0.2981 

0.8649 

1.1824 

1.3678 

0.18278 

1.3996 

3 


-61.30 

0.3045 

0.8824 

1.2991 

1.5568 

0.07137 

1.4828 

4 


-74.26 

0.3066 

0.8878 

1.3364 

1.6184 

0.03862 

1.5105 

5 

1 

-84.16 

0.3075 

0.8902 

1.3530 

1.6461 

0.02430 

1.5231 


Example 9.8. Design a low-pass filter of MFM type (Butterworth) to operate 
from a 6(X) D source into a 600 load, with a cut-off frequency of 500 Hz. The 
filter must be at least 36 dB below the d.c. level at 2 kHz, that is, —42 dB. 

Since 2 kHz is four times 500 Hz, it corresponds to a> = 4 rad s~Mn the 
normalized filter. Thus at co = 4, we have 

therefore, it = 2.99, so n = 3 must be chosen. Thus a third-order terminated 
Butterworth is required. From Figure 9.5 we have the normalized network 
shown in Figure 9.13a. 

The impedance scaling factor is 600/1 = 600 and the frequency scaling factor 
is 27 i 500/1 = 271500, that is, the ratio of the desired radian cut-off frequency to 
the normalized cut-off frequency (1 rad s“^). Note that the impedance scaling 
factor increases the size of the resistors and inductors, but reduces the size of the 
capacitors. The result is shown in Figure 9.13b. 


(a) (b) 



Figure 9.13 Third-order Butteiworth, low-pass, filter; (a) normalized, 
(units in Q, H, F); (b) scaled, (units in Q, H, pF) 
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Transformation rules 

All information given so far applies only to low-pass filters, yet we frequently 
need high-pass or band-pass filters in signal processing (Weinberg, 1962) 

(а) LoH-pass to high-pass To transform a low-pass filter to high-pass, we 
first scale it to a cut-off frequency of 1 rad s”*, if it ts not already 1 rad s“^ This 
allows a simple frequency rotation about I rad s"’ ofp -» 1/p All L’s become 
Cs, all C’s become L’s, and all values reciprocate The cut-off frequency does 
not change 

Example 9 9 Design a third-order high-pass Butterworth filter to operate 
from a 600 D source to a 600 Q load and having a cut-off frequency of 500 Hz 

Starting with the normalized filter of Figure 9 5 for which 3 = 1 rad s“ *, 
we reciprocate all elements and all values to obtain the filter shown in Figure 
9 14a, for which to -3 «= 1 rad s“^ Now we apply the scaling rules to raise all 
impedances to 600 fl and the radian cut-off frequency to 2n500 rad s" ^ as shown 
in Figure 9 14b 

( б ) Loiv-pass to band-pass To transform a low-pass filter to a band-pass 
filter we will first scale the low-pass so that the cut off frequency is equal to the 
bandwidth of the normalized band-pass filter for which coq - 1 rad (a>o ts 
the centre frequency of the band-pass filler) Then we apply the transformation 
p-*p + 1/p For an inductor 

2 ^ Lp transforms to Z = L(p + 1/p) 

For a capacitor 

y « Cp transforms to Y — C(p + 1/p) 

The first step is then to determine the Q of the band-pass filter where 


(/o is the centre frequency and B is the 3 dB bandwidth in Hz), scale the low- 
pass filter to a cut-off frequency of X/Q rad s“ then series tune every inductor 


1 '2 600 0,2653 

- ryTT) - -sSzi- 


Figure 9 14 Third-order Butterworth, high pass, filler (a) normalized 
(units in tl, H, F), (b) seated (units in ti h, pF) 
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Figure 9.15 Low-pass filter (units in D, H, F) 


L with a capacitor of value 1/L and parallel tune every capacitor C with an 
inductor of value 1/C. 

Example 9.10. Design a band-pass filter centred at 100 kHz having a 3 dB 
bandwidth of 10 kHz starting with a third-order Butterworth low-pass filter. 
The source and load resistors are each to be 600 D. 

The Q required is 


100 kHz 
10 kHz 


10 


or 



Scaling the normalized low-pass filter of Figure 9.15a to co_ 3 dB = l/Q = 
0.1 rad s" *, we obtain the filter of Figure 9.15b. Now, converting to band-pass 
with (Oq = 1 rad s"^ we obtain the normalized filter of Figure 9.16a. Next, 
scaling to an impedance of 600 Q. and to a centre frequency of /o = 100 kHz 
(coo = 27tl00k rad s“ *), we obtain the filter of Figure 9.16b. 


(a) (b) 



Figure 9.16 Band-pass filter (Q = 10): (a) normalized, Wq = 1 rad s~‘ 
(units in Q, H, F); (b) scaled, /g = 100 kHz 


9.2 ACTIVE SIGNAL PROCESSING 


9.2.1 Introduction 

The addition of active elements can be done in two ways: (a) as a buffer or gain 
element only; or (b) to provide feedback so as to eliminate the need for induc- 
tance and still be able to realize the functions we previously discussed (passive 
RC circuits cannot realize any of those functions beyond first order). This 
second category of networks is called active RC. 
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922 RLC Sjnthesis b) Buffer Isolation 

By factoring a transfer function into quadratic terms and first-degree terms as 
needed and isolating the sections by buffer amplifiers, the synthesis problem 
becomes trivial as is shown by the next example It is, in fact, synthesis by 
inspection This is useful when we wish to realize functions not covered in the 
preceding sections 


Example 911 


Design a network realizing 


np) = 


' P + ^ \( jp \ 
y + p+ 4/[p‘ + ^p + lj 


This can be split into two sections, first. 


and, second 


Tf )- P + ^ I + 3/p 

p^ + p + 4 p -b 1 -b 4/p 

T rnl — i 

'^^~P^+ip + 2 P + 1+2/P 


Note that all quadratics when divided by p have terms which are inductise, 
resistne, and capacitive in that order, and we can realize each of the above 
factors with a simple L section voltage divider by inspection as m Figure 9 17 
By separating the two sections with a buffer amplifier, we prevent interaction 
between the sections which would otherwise change the transfer function 


Note that the numerator of the second factor is a resistor of ^ D, and the corre- 
sponding denominator term was also | D If any numerator term were larger 
than the corresponding denominator term, we would obtain a negative result 
when the numerator was subtracted from the denominator to obtain the senes 
branch of the L section, but since we are using buffer amplifiers, we can divide 
the numerator by any arbitrary number so that this does not occur and then use 
a corresponding, buffer amplifier gam to exactly achieve the desired transfer 



Figure 9 17 RLC synthesis using buffer 
amplifiers (units in fi. H. F) The transfer 
funaion is given by 


T(P) = 


i—) 

r, Vp + I -b Alp) Vp -I- i + 2/p/ 
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function. Another highly useful result using this method of synthesis is that 
different impedance scaling factors can be applied to the separate sections 
without affecting the result. Frequency scaling factors must, of course, be the 
same for all sections. 

This method is not the most economical of elements, but this is of little 
consequence in the early stages of the design of a measurement system. After the 
design is complete and it has been determined that the particular filter is the one 
required, more sophisticated design methods can then be used to save elements 
if desired. 


9.2.3 Active Feedback 


Introduction 


A very large number of active feedback synthesis methods have been developed 
in order to eliminate the need for inductance in filter networks. Each method 
has applicability to certain frequency ranges or to a particular pole Q. For a 
quadratic factor + y, the pole QisQ = For many networks the 

(2 sensitivity to amplifier gain K defined as 


S^ = 


QdK 


(9.33) 


is a strong function of Q, thereby limiting the application of those networks to 
low-Q systems. In addition, some networks have a passive element sensitivity 
problem which also limits their applicability to low-Q designs : 


QdR ^ QdC 


(9.34) 


To simplify this very large field (Kerwin et al., 1972), we will consider only three 
active RC configurations: first, networks suited to low-g applications (the Q 
of the poles of most low-pass filters is quite low) that use low-gain voltage 
amplifiers, so that operation is possible over a very wide frequency range, 
second, a network suited to high-Q applications, third, a single amplifier 
network that provides jco axis zeros. 


Low-gain second-order low-pass active RC filters 

An early class of active RC filters (Sallen and Key, 1955) used individual quad- 
ratic RC sections with feedback from a voltage amplifier as shown in Figure 
9.18. The transfer function is given by 

K/C 

p^ + (1.1/C + 1 - K)p 4- 1/C 


T(p) = 


(9.35) 
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Figure 9 18 Second-order active 
RC low pass, filter (units in fi. F) 


As can be seen, the amplifier gam reduces the p coefficient by direct sub- 
traction and when this coefficient becomes less than 2 (for C — 1 F), we have 
complex poles and therefore can duplicate an RLC network of the same order 
These structures have high sensitivity to amplifier gam change From the 
transfer function of equation (9 35X wc can determine that 


50 = 


K 

ll/C -hi - K 


(936) 


and thus for C = 1 F, Q = 10, K = 2 0 we have Sf = 20, that is a 1 % change 
m K produces a 20 % change in Q ' We must therefore restrict use of this network 
to loW'Q systems 


Example 9 12 Using the positive-gain active RC structure of Figure 9 18, 
design a fourth-order low-pass Chebyshev filter of 0 03 dB npple, and determine 
the d c gam and the Q sensitivities to amplifier gam change 
We derived the fourth-order Chebyshev polynomial of 003 dB npple in 
Example 9 5 and in factored form, it is 

(p^ -h 0 6748p 4- I 6309KP* + 1 6291p + 09238) 

Thus, we find for the first factor 

1 -K, = 0 6748 += 16309 

Cl Cl 

C, =06132 F Kt= 21191 

and for the second factor 

1 1 1 

— + 1 - Kj = 1 6291 — = 0 9238 

Cj Cz 

C2 = 10825F JCj =03871 

Cascadmgthese two networks, weobtainthecompIetefourth-orderChebyshev 

shown m Figure 9 19 

This normalized fourth-order Chebyshev low-pass filter can now be scaled 
in impedance and frequency as desir^ The normalized cut-off frequency is 
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Figure 9.19 Fourth-order Chebyshev active filter having 
0.03 dB ripple (units in D, F) 


1 rads ^ The d.c. gain is (2.1191)(0.3871) = 0.8203, and the sensitivity to 
changes in gain of the first operational amplifier is 


5 ^ 

1.1/Ci + 1-K^ 


= 3.14 


and for the second is 


I.I/C2 + 1 - K2 


= 0.24 


Higher-order single-amplifier low-pass filter 

Higher-order transfer functions can be obtained with a single amplifier. The 
most useful are the third- and fourth-order functions as shown in Figures 9.20 
and 9.21 (Aikens and Kerwin, 1972; Kerwin et al, 1972). 


State variable second order 

The state variable active RC synthesis method was developed in 1967 (Kerwin 
et al., 1967). It is a more complex structure requiring three amplifiers for a low- 
pass, band-pass, or high-pass second-order structure and four amplifiers for a 



Figure 9.20 Third-order active RC, low- 
pass, filter (units in fl, F) 



c, 

C 2 

C 3 

Butterworth 

1.7058 

0.8671 

0.6761 

Thomson 

0.7064 

0.3100 

0.3046 

0.5 dB 
Chebyshev 

2.2932 

0.9940 

0.6130 
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Figure 9 21 Fourth order active RC, low-pass, 
filter (units m H F) 



Cl 

Cj 

Cj 

c* 

Butterworth 

1 1746 

23041 

0 8519 

04338 

Thomson 

03391 

07413 

0 2500 

01516 

05dB 

Chebyshev 

19860 

30640 

10367 

04181 


complete biquadratic function The reason for considering such a complex 
structure is that the sensitivity to both active and passive elements is very low 
(all < 1 and independent of the Q) In addition, the minimum number of capaci- 
tors IS used (2) even when a complete biquadratic is required A normalized 
schematic is shown in Figure 9 22 for the three-amplifier structure 
Even though all three types of filters are obtained simultaneously, the primary 
use of this structure is for band-pass filters since that is when we need low-Q 
sensitivity The band-pass output transfer function (assuming ideal operational 
amplifiers), the Q, and the centre frequency o)o are 


T(p) = 


Ml + R)\ 

( ’’ 1 

\ l+Ri I 

Vp" + [(1 + fi)/(l + Rj)]p + RJ 


(1 -F 

I +R 


(937) 

(938) 


tUo = v/f? (9 39) 

As can be seen no subtractions exist, which was the cause of the high sensitivity 
m the previous method These structures can easily be cascaded for higher-order 
transfer functions 



Figure 9 22 State variable second-order filter in normal 
ized form (units in 12, F) 
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Example 9.13. Design a second-order band-pass filter having a Q = 50 at a 
centre frequency of 10 kHz using the state variable network. Use a minimum 
resistor value of 10 kD. What is the centre frequency gain? 

From equation (9.39), we normalize to coq = 1 rad s“ ^ : 

cOq = y/R = 1 R = IQ 
and from equation (9.38), for i? = 1 Q, 

<2 = ^-^=50 J?2 = 99n 


The centre frequency gain is 


j?,(i + R)n + RA 

1 +R 2 \ 1 + 1? j 


= -99 


Scaling to 10 kfl, we have a scaling factor of 10k, and from oiq = 1 rad s“ ^ to 
coq = 2nl0k, we have a frequency scaling factor of 27rl0k. The result is shown in 
Figure 9.23. 


The amplifier gain must be at least twice the Q required at the operating 
frequency, and must be much greater than twice the Q if the value of R 2 calculated 
is to be correct. If we used an operational amplifier having a gain of 10^ and a 
-3 dB frequency of 10 Hz, then at 10 kHz the gain would be 100 or just barely 
adequate for the above example even if we removed the 990k resistor. So we can 
see that this method is restricted to primarily the audio frequencies, unless we 
use high performance operational amplifiers. 


jo) Axis zeros 

To design filters such as the inverse Chebyshev, we must have jco axis zero 
capability. The most commonly used RC method for jw axis zeros is the twin-T. 
By providing appropriate feedback and network loading a pair of jm axis zeros 
and a pair of independent complex poles can be obtained (Kerwin and Huelsman, 
1966). The schematic is shown in Figure 9.24 for the case where the zeros are 



Figure 9.23 State variable second-order filter of 
Example 9.13 
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Figure 9 24 
K - 

Tip) = 


2z-2p network (units in Cl, F) 


10/31 


I, a \p^ + 0P + i) 


beyond the poles (greater radial distance from the ongm) The transfer function 
IS also shown m Figure 9 24 

Example 914 Design an inverse Chebyshev 2z-4p active RC low-pass 
filter with a cut-off frequency of 5 kHz and zero output at 10 kHz. Scale the 
impedance level by 5k 

Using the T(p) from Example 97, clearing of fractions, and factoring, we 
obtain 


r(p) = 


-I- 4 

(3p^ -h 6 1293p + 3 8375KP* -F 0 6845p + 1 0423) 


This function has a cut-off frequency of 1 rad s~‘ and a zero at 2 rad s~‘ When 
scaled by 27r5k m frequency, we will have a cut-off frequency of 5 kHz and a 
zero at 10 kHz as required 

We will realize the 2z-2p network first, and factonng out we have 


T,(p) 




p^ + 4 


2043IP-F 12792^ 


7 3 + Pp + yj 


With these values of a, p, y and from Figure 9 24, we obtain the network shown 
in Figure 9 25 There is a multiplier of (y/a) /C = 0 3293, but this has no effect 



Figure 9 25 2z~2p RC network (units m 

an 
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Figure 9.26 Complete 2z-4p inverse Chebyshev filter (units in Q, F) 



Figure 9.27 Scaled 2z-4p inverse Chebyshev filter 


on the pole and zero positions. Now we must realize the second factor, T 2 {p) 
where 


+ 0.6845P + 1.0423 

and cascade it with Ti(p) to realize T(p). Since we are using building blocks in 
which the amplifier is at the output, it does not matter which network is placed 
first. Using the network of Figure 9.18, we find 

1/C = 1.0423 C = 0.9594 
^ + 1 - iC = 0.6845 K = 1.4620 

Cascading these two networks as shown in Figure 9.26, we obtain the complete 
T{p). Note that the d.c. gain achieved is (1.0296)( 1.4620) = 1.5053. 

Scaling the impedance upward by 5k and to a cut-off frequency of 27r5k rad s" ‘ 
(5 kHz), we obtain the network shown in Figure 9.27. 


Low-pass to high-pass transformation 

We cannot use the low-pass to high-pass transformation since we would obtain 
an RL network, not an RC network. If, however, we multiply all impedances by 
P (the impedance scaling rule tells us that we can multiply by any constant, real 
or complex), we obtain an RL network which when transformed to high-pass 
gives us the RC high-pass we want. This is illustrated in Figure 9.28. We start 
with the normalized low-pass filter (Buttenvorth, (U_ 3 dB = ^ rads”') and 
obtain the normalized high-pass Butterworth filter = 1 rad s~ '). 
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Figure 9 28 Low pass to high pass iransrormation of an active RC second- 
order Buttenvorih filter (units in fi, H, F) 


Low-pass to band-pass iransfortnaiton 

We have no general rule here, however, earlier m this section we designed a 
band-pass filler using the state variable method This is a very practical band- 
pass synthesis method and does not require starling with a low-pass prototype 
Higher-order band-pass filters only require cascading several second-order 
sections which are individually designed by using each of the quadratic factors 
of the higher-order function These can be obtained from a low-pass function 
by making the substitution p =« QA, where Q is the band-pass filler Q desired 
Then substitute A = p -i- I/p to convert the function to band-pass, factor into 
quadratic sections, realize each one with the state variable network, and connect 
them in cascade 

Example 9 15 Transform the normalized second-order Butterworth low- 
pass function to a g = 10 band-pass function (coo = 1 rad s"*), and factor U 
into two band-pass quadratics 

Given 


substitute 




1 

p^ + p^2 + I 


to yield 


p = QA = lOA 


r(A)r= 


lOOA* -f I0AV2 + I 


Let 


^=p+ lip 
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then 

1 / 

[p^ + 0.1414p3 + 2Mp^ + 0.1414p + 1 

factoring gives 

"" Too (p2 + 0.0682p + 0.9317)(p2 + 0.0732p + 1.0733 

9.3 TIME DOMAIN CONSIDERATIONS 

To this point we have discussed only the steady state aspects of signal processing. 
In considering the time domain response, the most common test signal is the 
step function. The response to a step input is illustrated in Figure 9.29 shown 
for a final value of unity. 

One of the features mentioned earlier was the difference in time domain 
performance of the Thomson as compared to the other filter functions. It has 
very little overshoot or oscillation to a step input. For those applications where 
this is of importance, it is the function to choose. This does not come free, how- 
ever; the Thomson has a much less selective filtering characteristic than the 
others. The transient performance of various third-order filters to a step input 
is compared in Table 9.7. Note particularly the excellent settling time of the 
Thomson. All of the functions have been scaled to a — 3 dB frequency of 
1 rad s~ ^ This is essential if a fair comparison is to be made. 

Scaling in the time domain is very similar to that in the frequency domain, 
but is inverted. Ifwewereto increase oj_ 3^3 from 1 to 10® rad s~Sthen all times 
in Table 9.7 would be divided by 10®; that is, they would be in microseconds. 

The networks of Table 9.7 are shown in Figure 9.30; all are scaled to 
®- 3 dB = 1 rad s“ ^ ! All values are in Q, H, F. 

Many useful design texts have been published on active filter design. A 
selection is Budak (1974), Daryanani (1976), Hilburn and Johnson (1973) 


Overshoot 



Figure 9.29 Step response definitions 
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Table 9 7 Third order system step response for © 3^8=^ s ‘ (all times in seconds) 



Rise time 

Delay time 

Overshoot 

(%) 

Settling time 
(to within 1%) 

Butterworth 

229 

214 

8 15 

942 

Thomson 

218 

168 

0 75 

3 78 

Chebyshev (0 1 dB) 

2 39 

238 

1019 

12 50 

Chebyshev (0 5 dB) 

250 

255 

8 93 

13 54 

Inverse Chebyshev 
(©_ , = 2 rad s *) 

2 59 

188 

1096 

9 24 


Johnson and Hilburn (1975), and Huelsman (1977) Commercially available 
integrated circuits, backed by application notes, have greatly simplified active 
filter application 



Sutierwerth Chebyshev 0 1 flS 



Thomson Chebyshev 0 5 dB 


09687 



Inverse C)>eby^ev2z-3p 

Figure 9 30 Third order networks characterized in Table 9 7 for 
^ 3 dB = 1 rad s'* (units in n, H, F) 


9,4 COMPUTER-AIDED DESIGN 

The term ‘computer-aided design’ can be misleading and it is sometimes 
replaced with ‘computer-aided analysis’ In general, the available computer 
programs are analysis programs, but of course accurate and fast analysis does 
aid the designer 
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One of the most useful and readily available programs is SPICE developed 
by Drs B. Cohen and D. O. Pederson of the University of Cahfomia at Berkeley 
(Cohen and Pederson, 1976). This circuit simulation program works from a 
nodal description of element locations, and for the purpose of analysing the 
kinds of networks we have been concerned with includes voltage-controlled 
voltage sources (as well as others). The output can be a magnitude, or phase, or 
transient response to a pulse input, either as a print-out or plot. The program is 
limited only by the computer it is used with. Non-linear d.c. analysis is also 
included, as is noise analysis and distortion analysis. 

The program is very useful for determining changes in performance of a 
circuit as a function of element value change either due to tolerance, or change 
with temperature. A temperature coefficient can be included with each element 
specified and the analysis performed at various specified temperatures. 

The program also includes modelling capability for four types of semi- 
conductor devices: diodes, bipolar transistors, field effect transistors, and 
MOSFETs. Other versions are available ; I-SPICE (for interactive SPICE) and 
T-SPICE (for thermal SPICE). T-SPICE allows a complete thermal analysis, 
given the chip thermal characteristics and the complete circuit. 

Another useful program for signal processing applications is GOSPEL 
developed by Dr Lawrence P. Huelsman at The University of Arizona 
(Huelsman, 1968). This is an optimization program which can be used to solve 
sets of simultaneous non-linear equations such as are encountered in trying to 
determine circuit element values to realize a particular function. This program 
was used by Aikens and Kerwin (1972) to determine the element values for the 
single amplifier active RC structures given in Section 9.2.3. 
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Chapter 



A. G. BOLTON 


Filtering and Processing of Digital 
Signals 


Editorial introduction 

Signals in the processing chain of a measurement system may require that their frequency 
component characteristics be modified. The previous chapter described how this can be 
done using analog electronic circuitry operating with analog signals. Despite the great 
internal circuitry and system complexity of digital data processing systems they are often 
capable of carrying out processing operations on a signal at lower cost than their analog 
counterpart. Analog signals are first converted into the digital signal domain; in many 
cases the signal to be processed already exists in digital form. 

Digital filtering is not a recent concept but it did emerge after the analog methods 
arriving predominantly as a basic concept in the application of the early digital computing 
machines to such fields of use as geophysics (see e.g., Robinson and Treital, 1964). It finds 
increasing use as digital computing hardware costs fall and because many signals needing 
filtering now appear in the digital format. This form is also often chosen because many 
more systems designers today are more familiar with digital hardware than with the linear 
counterpart, analog system. 

With analog filtering methods there has emerged a well-defined subset of knowledge 
directed very specifically toward what have become known as filters. The term digital 
filtering is largely synonomous with the description digital signal processing. For this 
reason digital filtering information is contained within general digital signal processing 
texts or, if presented under the specific title, will usually include a wide range of basic 
signal processing knowledge. Suitable introductory texts are Bogner and Constantinides 
(1975) and Hamming (1977). Here digital filtering is explained in terms of implementation 
of systems analogous to the analog filters discussed in Chapter 9. 
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10 I IMPLEMENTING ANALOG TECHNIQUES IN DIGITAL 
FORMATS 


10 1 1 Introduction 

Dictlal techniques are \ery flexible because of programming and memory 
facilities The digital components and equipment available provide a variety of 
new techniques for measurement systems This chapter introduces some pro- 
perties of digital systems by outlining how analog techniques can be implement- 
ed digitally Also prov ided isa brief introduction to Z domain analysis Chapter 5 
presents an alternative approach to digital signal processing 
The performance of a digitally implemented filler can often resemble that 
of an analog equivalent so closely that for all practical purposes they are 
identical This provides a useful basis for introducing digital filters Special 
features can be introduced gradually 

10 1.2 Review of Analog Filters 

The previous chapter discusses the range of analog filters available and their 
implementation in some detail A summary is given here so the relevant aspects 
can be identified for application to digital fillers 
A filler IS usually designed m low-pass form and transformed to high pass, 
band pass or band-stop as required The low-pass filter is selected to fulfil the 
frequency discrimination or step response requirements often both aspects 
must be considered together This is because a filter which removes unwanted 
high frequency components can distort the desired signal considerably within 
the passband Graphs can be used to select fillers or approximate algebraic 
relationships applied to realize the component values 
Chapter 9 includes diagrams of overshoot :n the step response and ripple 
in the frequency response Figure 10 1 shows the overshoot and ripple properties 
for various second-order filters Higher order filters improve the frequency 
discrimination with a liule increase m the overshoot These are implemented 
in both active analog and di^laJ Jillcrs by sddmg ciLscaded second-order 
stages Stages w iih the highest damping factors should be earliest in the sequence, 
otherwise the signal levels within the filler can be quite large 
Table 10 1 gives values for the pole locations for several types of low-pass 
filters The Bessel and Chebjshev filters are normalized so that the 3 dB point 
IS at unity frequency Various normahzmg relationships are m use and should 
be checked before using any tables Note that the ripple in the even-order 
Chebyshev filters is above the steady stale gam whilst it is below for fillers of 
odd order 

Using the filter design tables the analog filter can be implemented using pure 
integration A useful arrangement is given in Figure 102 It has a pass-band 



ni.inUNC ANH PKOa.SSlNG or digitai. signai.s 


41? 


(o) 




Domping ongU? (deg) 

I'icurc lO.I (;i) Polo localions and (b) corrcspondinc o\cr>.lioot 

and ripplo 


Cain of unity and the signal levels at the outputs of the integrators are nearly 
eijual. Also the design relationships are very convenient to implement. 

Digital integration appro.\imatc.s analog integration for large sampling rates. 
In its simplest form it is implemented using the addition operation. This allows 
a useful range of digital filters to be implemented quite conveniently. Peculiarities 
of digital filters will be examined after basic design techniques have been des- 
cribed. 

10. 1.? Uasic Design Techniques 

In l igure 10.? the analog signal to be filtered is cotuerted to digital form using 
an analog-to-tiigital (A D) converter, processed u.sing a computing element 
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Table 10 1 Some selected pole locations for low-pass filters 


Imb) 



Type 

Order 

(0 

P 


b 

Bessel 

1 

10000 


10000 



2 

12720 

08660 

1 1016 

06360 


3 

1 3227 


1 3227 




14476 

07235 

10474 

09993 


4 

14302 

09580 

I 3701 

04102 



16034 

06207 

09952 

1 2571 

Butlenvorth 

1 

10000 


10000 



2 

10000 

0 7071 

0 7071 

0 7071 


3 

10000 


10000 




1 0000 

0 5000 

0 5000 

0 8660 


4 

10000 

09239 

0 9239 

03827 



10000 

03827 

03827 

09239 

3 dB Chebyshev 

1 

10024 


10024 



2 

08414 

03882 

03224 

07772 


3 

02986 


02986 




09161 

01630 

01493 

09038 


4 

04427 

0 4645 

02056 

03920 



09503 

00896 

00852 

09465 



Figure 102 Analog filter implementation using inte- 
gration 

+ 2p(os + 0 )^ 


T(5) = 
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Figure 10.3 One basic digital filtering arrangement 


such as a microprocessor, and again converted to analog form. Various other 
sources of digital signal data, including a transmission system, data storage, or 
a previous digital filter, could arise. Similarly the output could be used in a 
digital form in a variety of ways. The simple analog input and output system 
serves here as a useful introduction. 

The analog quantity is represented within the digital system in binary weighted 
form. It is usually most convenient to represent negative quantities using two’s 
complement notation (Hill and Peterson, 1974). This allows the same arithmetic 
program instructions to be used with both positive and negative quantities. 
Replication of the analog system by the digital system relies on various assump- 
tions. These assumptions will be examined in detail after the basic design 
procedure has been outlined. They are, with respect to time, that 

(a) the sampling rate is larger than any frequency component present at the 
analog input; 

(b) the sampling interval is smaller than any time constant within the filter; 
and with respect to magnitudes, that: 

(c) the binary data represent the analog signal with an arbitrary degree of 
precision; 

(d) the analog quantity is never too large to be represented within the digital 
system; 

and with respect to coefficients that : 

(e) the coefficient values can be implemented precisely. 

Digital integration can be performed by adding successive input values. The 
program flowchart of Figure 10.4 will provide digital integration which nearly 
approximates analog integration given the previous assumptions. The sampling 
interval T determines the constant of integration. The output from the digital 
integrator will be l/T times greater than that from an analog integrator. In a 
microprocessor the time delay T can be obtained using a fixed number of 
instructions with a known execution time or an external clock which periodi- 
cally interrupts another program to perform the filter algorithm. The filter 
time constants are proportional to the sampling interval T. It is, therefore, 
important to fix its value. This gives an advantage to digital filters. The interval 
can be varied to tune filters of arbitrary complexity with precision. In the digital 
filter design it is convenient to normalize frequencies and time constants to the 
sampling interval. 
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e{/)“ some coses 

Figure 104 Program flowchart for 
digital integration 


The useful first order filter given m Figure 10 5 uses the integration algorithm 
It approximates an analog filter having the transfer function 

T(s) = — ^ where t = 1/a, 

TS + 1 

The subscript a stands for analog In the corresponding digital filter the co- 
efficient a must be adjusted to allow for the digital integration because it has 
an output 1/r times greater than that of the analog integrator It is convenient 
to normalize the time constant and cut off frequency of the filter to the sampling 
interval 


Tn = t/T and <o„ = (oT 
The coefficient of the digital filter is then a = l/t„ 

Example 10 1 A first-order, low-pass digital filter with a cut-off frequency 
of 100 Hz (co<. = 628 rad s' *, r = 16 ms) is to be implemented 
The steps necessary to design such a filter are 

(a) Choose the sampling interval Use about to times the time constant 


T = 0 1 ms 
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Figure 10.5 Block diagram arid flowchart for a first- 
order, low-pass digital filter 


(b) Calculate the normalized time constant; 

t„ = t/T=16 

(c) Evaluate the coefficient 

a = 1/T„ = 2-^ 

(d) Use the flowchart of Figure 10.5 with a = 2“^ and T = 0.1 ms to imple- 
ment a digital processor. 

Extension of this design procedure to second 

diagram and corresponding program flowchart of Figures . 

this case it is necessary to use tables to find the pole locations. Again cut-o 

frequency can be normalized to the sampling interval as shown in the following 

example. 












418 


HANDBOOK OF MEASUREMENT SCIENCE 



Figure 107 Program flowchart for a 
second-order digital resonator 
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Example 10.2. A second-order, 3 dB, Chebyshev low-pass filter with a 
cut-off frequency of 40 Hz {o3^ = 251 rad s“ ‘) is required. 

(a) Choosing a sampling interval less than one tenth the time constant; 

T = 4 ms 

(b) Calculate the cut-off frequency normalized to the sampling interval; 

tOcn ~ (OcT = 0.1004 

(c) Select the required pole location using the tables for analog filters; 

m = 0.8414 )5 = 0.3882 

(d) Evaluate the filter coefficients; 

G>f = = 0.1004 X 0.8414 = 0.08448 

P{ Stables 

Note that the damping factor is not altered by the scaling for frequency. 

(e) Implement the filter using the program flowchart of Figure 10.7. 

The step response for this filter is given in Figure 10.8. 

10.1.4 Review of Design Assumptions 

It is necessary to examine the assumptions which made the digital filter closely 
resemble the analog equivalent. This allows the selection of important design 
aspects such as register lengths, sampling rates, and any necessary preparation 
of the analog signal before conversion to the digital form. 



— I 1 1 1- 

50 100 150 200 

Sample 

Figure 10.8 Step response of filter design for Example 10.2 
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Fieure 10 9 Anti aliasmg (low. pass) filter before A/D con^ ersion 


The first assumption was that the sampling rate was larger than any 
frequency component present at the input lo the A/D comerter If they were 
present aliasing would occur This means the high frequency components 
appear as low frequenc) components wnlhin the passband after sampling. Time 
and frequency domam representations of aliasing are gisen in Chapter 5 

The frequency domain mterpretaiion is based on the fact that the product 
of two smusoids p\es both the sum and the difference frequency components 
Saraphng at mten'als of T can be regarded as muhiphcation by all sinusoids 
of the form cos(2*tj/0 where n as 0, I, x In this way the frequency com- 
ponents abo\e half the samplmg frequency (the Nyquist frequency) are folded 
back as unwanted lower frequency components These high-frequency com- 
ponents must be removed using anti aliasing low-pass filters before the A/D 
conversion as shown in Figure 109 The frequency domam representation of 
ahasmg gives an estimate of the amount of filtermg required, the frequency 
components lematnmg above the Nyquist frequency will appear within the 
passband with the same magnitude 

It IS ofmterest that use can be made of this scheme to provide selective band- 
pass filters. The A/D converter can act as a frequency translator, the anti- 
ahasmg filter becoming a band-pass filter Quadrature components are needed 
to complete the frequency translation 

A second assumption was that the samplmg mterval is smaller than any time 
constant m the filter Longer samplmg mtervals mean that the digital mtegration 
no longer approximates the analog mtegration and the filter’s output will differ 
from the designed response Generally, sampling rates ten tunes any time 
constant are adequate to give an accurate approximation this can be improved 
using compensation. Relationships to mamtam the magnitude frequency re- 
sponse m the output are (Bolton, 1981) 

o, = (1 + o)cos(b) — 1 6, = (1 + a)sm(h) 

then 

“ = VCoJ + bjUaj +bj + 2a,+ 1)] 

J- 6|) + 1] 

For lower samplmg rates the bilinear transform, described m the following 
section, provides a more useful filter 
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’h’ 

Output 


Figure 10.10 Signal points where underflow and overflow can occur 

A third assumption was that the analog data were represented with an 
arbitrary degree of precision. Finite digital register lengths mean there must be 
some magnitude quantization. This aspect is also discussed in Chapter 12. 

The input and output, A/D, conversion must have a sufficiently small quantiza- 
tion level to represent the signal with the desired accuracy. 8-bit conversions 
will give a level of of 0.4% of the full analog range. However, the signal 
levels within the filter are also quantized; usually these effects are more severe 
than those input and output conversions. Various signals are labelled in 
Figure 10.10. 

The dominant effect of quantization within the digital filter occurs at point ‘e’ 
of Figure 10.10. The difference signal, r — e is multiplied by the coefficient, a, 
to give a component at ‘e’. The coefficient is much less than unity, particularly 
for large sampling rates. Underflow at ‘e’ will occur when the original difference 
r - e is l/fl times larger than the quantization level. This can cause a deadband 
in the output which is 1/a times the quantization level. An original quantization 
level of 0.4% can be expected to give a deadband of 4% even if the sampling 
rate is only ten times the time constant of the filter. This must be avoided. 

One solution is to use longer registers within the filter, leaving the input and 
output data conversions unchanged. In practice use of 16-bit registers and time 
constants about ten times greater than the sampling interval provides workable 
operating levels. Larger sampling rates make the filter more ideal and reduce the 
aliasing filter requirements, however, double precision arithmetic may be 
needed to reduce the deadband effect. An alternative scheme is to cascade 
digital filters, the first having a faster sampling rate and smaller time constants. 
In effect this is a digital aliasing filter. Another approach is to use an instruction 
to detect whenever the signal at ‘b’ is greater than zero. If in these cases a least 
significant bit is added at ‘g’ the deadband will be removed and there will be 
only a minor distortion of the filter characteristic. This also removes all small- 
signal limit cycles (Bolton, 1981). 

A fourth assumption was that magnitude overload does not occur. With 
digital systems magnitude overload causes a discontinuity in the output to the 
other extreme value. A slight overload in the positive direction will cause a 
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large negative value This contrasts with analog systems which usually saturate 
This IS further described under explanation of glitches m Chapter 12 
The effect on the output of the Wter is much greater in digital systems where 
overloads can cause sustained oscillations These can be avoided by providing 
a saturating charactenstic at pomts‘d’, ‘f’.and ‘h’ of Figure 10 10 usingprogram- 
ming techniques Alternatively the signal levels and input conditions can be 
confined so that overloads can never occur With the Butterworth, Thomson, 
and Bessel filters the signal levels within the filter should never exceed twice the 
maximum input signal level In higher-order filters the stages must be m order 
of ascendmg Q, with the signal passing through the more damped stages 
first The 3 dB Chebyshev filters require an additional factor of 2 
Finally, it was assumed that the coefficients could be implemented precisely 
With the type of digital filler described here the response never cntically 
depends on the values of the coefficients (Bolton and Davis, 1981, Bolton, 
1981) Having rounded the value of a coefficient for implementation it may 
be helpful to assess its affect on the filter’s response by evaluating the pole 
locations for these new values With some types of digital filters the response 
depends cntically on proportionate changes in the coefficients to the point 
where the available transfer functions are severely hmited However, with the 
filters descnbed so far the ability to realize the coefficients with precision provides 
stability advantages over corresponding analog types 

IQ.l^ Extension of Low-pass Designs 

Extension of the low-pass design to give a high-pass response uses the transform 
descnbed in Chapter 9, that is,s -► 1/s, which means poles at r /+6 are relocated 
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at l/r /±0. The output from the digital filter is then taken from the point shown 
in Figure 10.6. 

The transform to give a band-pass output is s ^ (s^ + cOcentre)A- The transfer 
function at the bandpass output of Figure 10.6 approximates 

rr,, ^ _ (OS 

' s^ + 2p(os + 

and this can be used to implement band-pass designs. 

The notch output is quite deep at high sampling rates. At low sampling 
rates the special notch structure of Figure 10.11 is required to provide a deep 
notch (Bolton, 1981). Three delay terms are shown. Each is obtained by using 
the value from the previous loop. This means the notch addition is performed 
after the high-pass output is computed but before the low-pass output. 


10.2 ANALYTIC TECHNIQUES FOR SAMPLED SYSTEMS 


10.2.1 Introductory Remarks 

At low sampling rates the comparison between analog and digital filters becomes 
inadequate for analysis or design. The transform which represents digital 
systems without approximation is the Z transform, in the same way as the 
Laplace operator s is used with continuous systems. The Z transform is now 
introduced and a filter design example given to serve as an introduction to 
specialist texts. 

10.2.2 Sampling 

When an A/D converter samples a value of a signal subsequent changes in the 
signal are not registered by the computer until the next sample is taken. It is as if 
the value of the signal is held constant during the time interval. However, 
holding the value of the sampled function constant is mathematically more 
complicated than representing the sampled output by a series of delta (5) 
functions. The delta or impulse function, 5(t — T), is a function of time which is 
zero everywhere except at time f = T. At t = T its value goes to infinity for an 
infinitesimally small duration. The area of a unit impulse is one unit. The 
function 2b{t — T) has an area, or integral, of two units, and so on. A sampling 
stage is described mathematically as 

rit) = f 5(t - nT)m 

n — Q 

Sampling is represented in Figure 10.12. 
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Figure 10 12 Sampling of an analog 
signal at intenals of T 


Convolution 

Sampled systems can be used to show the significance of convolution when 
determining the output of linear systems to known input functions Consider a 
sj-stem which has the response to a unit impulse given in Figure 10 13 (dotted 
curve) The output of this system decays by 50% at each sampling interval, 
which corresponds to a time constant t = T/ln 2 The impulse response ^*(0 
IS a senes of functions 

g*(t) = 6(1) + 0 55(1 - T) + 0256(t - 27) + 

If the input function to this system is a sampled ramp the output at any time 
IS the sum of the components corresponding to each of the input samples. 
This IS because the system is linear for which the principle of superposition 
applies 

When evaluating the output of^*(r) to r*(t) at, say, r = 4Tit is necessary to 
sum the four components present These arc 4, 1 5, 0 5, and 0 125, w hich add to 
6 125 This summation can be represented as, 

C*(4T) - i r*(nDg*((4 - n)T) 

■*t 
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--iT) 

0 T 2T 3T 

Time 

Figure 10.13 Impulse response of a system, 

where the function has been turned about f = 4T. The output at any time 
t = mT can be found using the more general convolution summation 

Wl 

c*{mT) = ^ r*(nT)g*{(m — ri)T) 

n = 0 

The convolution summation lends itself to a numerical evaluation using a 
digital computer. This is particularly useful when it is not convenient to describe 
the functions in analytic form. 

10.2.4 The Z-transform 

All the functions used so far have been functions of time. The use of the Laplace 
and the Z-transforms involve functions of other variables which transform to 
functions of time. The transforms are devised so that the convolution operation 
in the time domain corresponds to multiplication in the Laplace or z domains. 

The Z transform is defined so that a sampled function of time f*(t) is trans- 
formed to a polynomial function of z~\ F(z). The power of is given by the 
time of the impulse and the coefficient of the term is given by its magnitude. In 
general if 

f%t)= Y^aM-nT) 

n = 0 


^(z) = f a„z " 
11 = 0 



then 
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The example previously evaluated in the time domain can be used to illustrate 
the use of the z domain The ramp input becomes , 

R(z) = Oz® + l 2 ~* + 22“^ + 32“^ + 42“* + 

and 

G( 2 )= l 2 ® +052-* +025z"^ +01252-3 + 

The output function m the z domain, C(z) is given by the product of these two 
functions It can be seen that the coeffiaent of z~* m this product is 

term in z * = I 2 * x 0 \252~^ + 2z~^ x 025z ^ 

+ 32 3 X 052“' + 42“* X 2° 

= 6 1252 “* 

The convolution summation m the time domain has been obtained using the 
product m the z domain All of the output impulses are represented by the 
function C(z), the output at nTbeing given by the coefficient of z"" 

Most useful functions can be expressed as the ratio of finite polynomials 
in the 2 domain This allows simple and convenient multiplication operations 
in the 2 domain Remembering that 

» 1 + X + + x** + 

1 — X 


I 2 


1 - 1/(22) 2 - } 

Transform tables are useful to find expressions such as those for Rfz) 


when the sampling interval T = I unit 

Fwv/iV/sta ’.w X di-sm-asw VTaTis^wiTTitd wA'o \Vrt ViTnt dOTrawi 

obtaining the z domain function as a sum of terms m 2 “" Usually the function 
m the z domain contains a polynomial m the denominator so synthetic division 
is used to obtain the series of terms An alternative technique for obtaining the 
inverse 2 transform is the method of residues This is very similar to the cor- 
responding Laplace method If o/ot is the residue at a singular complex pole at 
Zpl§, the component from this pole at f := nTis 

2al2pr"’ cos[(n - 1)^ + a] 

It can be seen that if | Zp | > 1 and the pole hes outside the unit circle, the system 
IS unstable The unit circle gives the boundary of stability and corresponds to 
the jg; axis of the s plane 
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The z domain can be related to the Laplace domain using the fact that the 
operator z~^ in the z domain corresponds to the operator e“®’" in the Laplace 
domain. Often the equality z = e^^ is inferred to relate the Laplace and the z 
domains. In fact the equality is not strictly correct since, for example, e’^^^ 
gives a delay of T/2 whilst is not defined. The relationship between the 
Laplace and z domains can be inferred by comparing the responses in the time 
domain for given pole locations. When the impulse response from poles in the 
Laplace domain at a + jb is sampled, the poles in the z domain for the cor- 
responding response are at e‘ ‘/±b . 

In this sense z = e'T This is called impulse invariance and can be used as a 
design basis for filters. It should be remembered that the impulses do in fact 
differ because one is sampled, so the step, frequency, and phase responses will 
differ also. Figure 10.14 shows some examples of corresponding impulse in- 
variant responses in the Laplace and Z domains. 



Figure 10.14 Corresponding impulse invariant responses 
in the z- and s-planes 
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103 FILTER DESIGN USING THE BILINEAR TRANSFORM 

If a transfer function T(s) is given in the Laplace domain it can be implemented 
in the z domain using the frequency narped bilinear transform This is 

, z — I 
S = Q) cot(co/2) ^ ^ ^ 

where to is normalized to the sampling interval The implementation is ap 
proximate, however when the warping term is included the sinusoidal 
magnitude and phase response at to is identical to the ongmal Laplace domain 
function In filter design to is set equal to the cut-frequency so the salient 
properties of the filter are preserved 

Example 103 A second order Buttcrworth digital filter is to be implemen- 
ted having a cut-off frequency at 1 kHz (628 krad s" ') using a sampling fre- 
quency of 10 kHz and the delay implementation 
To design such a filter we use the following steps 

(a) From filter tables the poles are at l /± 135° which gives a transfer function 

“ (s + 1/V2)" + i 

This provides a gam of unity for arbitrarily small frequencies 

(b) The normalized frequency 

m = 01,7= 628 x 10’ x 10 * = 0628 

(c) The frequency transformation required for the analog design can be 
combined with the frequency warping relationship using the substitution 

s = tu^cot(ai/2)^^ = 0628 x J 934 = 1 21 

z + I z 4- 1 z -f- I 

(d) Find T(z) from 

Onyr^ +0274Z+ 1 

z" +0333Z + 0262 

This implementation is given in block diagram form m Figure 10 15 It is the 
preferred implementation for very low sampling rate systems because it provides 
zeros in the stop band which give better attenuation in this region However, for 
large sampling rates its coefficient values become critical and the signal magni- 
tudes vary considerably throughout the filter Also, at large sampling rates the 
provision of zeros at 2 = —1 is not necessary and they require additional 
computation 
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Figure 10.15 Block diagram of a delay implementation 
digital filter 


Analog filters with carefully placed zeros in the stop band can be implemented 
digitally using either design technique and the corresponding advantages 
realized. 


10.4 DISCUSSION 

The introduction to digital filtering techniques presented here concentrates on 
the implementation of approximations to analog filters. Whilst this does provide 
a useful introduction to some of the properties of digital filters and to some 
useful filters, digital systems provide many unique features. These can be useful 
in special situations; and measurement systems are a collection of special 
situations. Although the techniques outlined here may find direct application 
it is more likely this treatment will be used as an introduction to more compre- 
hensive treatments of the subject. 
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Chapter 



D. M. MUNROE 


Signal-to-Noise Ratio Improvement 


Editorial introduction 

A measurement system begins with sensing stages that couple to relevant measurands of 
the system under study. The power level of the information-bearing signals formed by the 
sensors is often very low and may be swamped by the unwanted noise signals that are 
present. Careful attention to sensor and circuit design and assembly, plus use of certain 
signal processing methods, makes it possible to greatly enhance the original signal-to-noise 
ratio to usable levels. 

This chapter discusses the various strategies that are available providing a basis for 
their use (the best information is available in the literature of companies marketing such 
products). It is surprising to find that, after well over half a century of analog signal process- 
ing progress, there still exists no full length text that deals generally with signal recovery 
and enhancement in general instrumentation applications. This chapter is unique in this 
respect, apparently being the first to appear in a published text. It deals with this material 
at an extended theoretical depth. 

It will be noticed that techniques, originally conceived for analog signals, are gradually 
being implemented in digital form, thus in many cases providing improved performance 
at comparable or less cost. This trend can be expected to increase with time as digitally 
oriented designers, seeking improved means of signal recovery, become more familiar 
with the principles already proved in the analog signal domain. 

11.1 INTRODUCTION 

Recovering or enhancing a signal or improving a signal-to-noise ratio simply 
means reducing the noise accompanying a signal. There are two basic ways of 
doing this: 

(a) Bandwidth reduction, where the noise is reduced by reducing the system 
noise bandwidth (BJ. This approach works well if the frequency spectra 
of the noise and signal do not overlap significantly, so that reducing the 
noise bandwidth does not affect the signal. With random white noise 
the output noise is proportional to y/B„. 
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(b) A\eragmg or integrating techniques, where successive samples of the 
signal are synchronized and added together The signal v-ill grow as the 
number («) of added samples, with random white noise the noise will 
grow as y/n 

In many applications there is significant overlap between the signal and noise 
spectra and improving a signal to-noise ratio must be done at the expense of 
the response time or measurement time (T), with random white noise inter- 
ference the output signal-to-noise ratio is proportional to y/T The bandwidth 
reduction technique is best looked at from a frequency-domain point of view, 
signal averaging and correlation techniques lend themselves to time-domain 
anal>sis 

In this chapter, mathematics and theoretical considerations will be reduced 
to a minimum, the reader is referred to Chapter 4 for additional theoretical 
and background information For further simplicity we will assume that all 
noise processes are stationary and that both signal and noise are crgodic, 
analog variables, we will not concern ourselves with digital signals or discrete- 
time (sampled) signals except where such signals arc involved m the enhance- 
ment techniques In addition, only signal recovery techniques will be 
considered Further processing, such as least-squares polynomial smoothing 
of a waveform or Founer transformation to obtain a frequency spectrum, will 
not be considered here 

We will start by reviewing some basic concepts, move on to discuss ways to 
avoid adding noise (e g hum pick-up and preamplifier noise) and then discuss 
mstrumentational techniques to reduce the remaining noise content Finally, 
we will discuss some of the special considerations involved in recovering pulse 
Signals from photon (light), ion, or electron beams 


11.2 NOISE AND NOISE BANDWIDTH 

Noise IS an undesired signal It usually becomes of interest when it obscures 
a desired signal Figure 11 I shows the power spectral density (powei;/unit 
bandwidth) of the most commonly encountered types of noise 
Deterministic noise can range from simple discrete frequency components 
such as power-line hum at harmonics of 50 or 60 Hz, to wide-band inter- 
ference (RFI) caused by narrow, high-energy pulses from power-line switching 
spikes, pulsed lasers, radar transmitters, and the like 
Stochastic (random) noise is found in most systems both as white noise, 
where the power spectral density is independent of frequency, and also as l/f 
or flicker noise, where the power spearal density decreases as frequency 
increases Power spectral density is usually measured m mean-squared-voIts/Hz 
or mean-squared-amperes/Hz, for noise, such specifications are usually 
referred to as spot noise data and usually are a function of frequency Notice 
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Figure 11.1 Environmental noise (reproduced by permission of EG&G Princeton 
Applied Research Corporation) 

that for an r.m.s. voltage of e (volts) and a frequency range of Af (Hz), the power 
spectral density, S, is given by 


“S-fe)' 


( 11 . 1 ) 


The quantity efJ{Af) is usually referred to as voltage spectral density and 
is measured in r.m.s.-volts/^Hz (volts per root hertz). Similarly, we can refer 
to current spectral density specifications in units of r.m.s.-amperes/^Hz. 

White noise is usually found in one of two forms: Johnson noise and shot 
noise. Johnson, or thermal, noise is caused by random motion of thermally 
agitated electrons in resistive materials, and the mean-square noise voltage is 
given by 

el = 4kTRAf (11.2) 

where k is Boltzmann’s constant (1.381 x 10"^^ JK"*), T is the absolute 
temperature (kelvin) and R is the resistance (ohm). Alternatively, from Ohm’s 
law, the mean-square noise current is given by 


AkTAf 

R 


(1T3) 
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Shot noise is caused by the random arrival of electrons (see Section 1 1 10 2) 
at. for example, the electrodes of electron tubes or transistor junctions A d c 
current, I, will have a noise-current component, j„, given by 

il=2AeW (lUa) 

where e is the charge of one electron (~ 1 6 x 10“ C), A is the mean gam 

experienced by each electron and / is in amperes In many cases (see Section 
11 102), /I = 1, so that 

tl = 2el&f (II 4b) 

Flicker noise has many different origins and is not clearly understood but 
exhibits a I//" power spectrum with n usually in the range 0 9 to 1 35 Note that 
d c drift IS a very-low -frequency form of flicker noise 

What do we mean by bandwidth'^ In the simple low-pass filter circuit shown 
in Figure 11 2a, for example, we usually and somewhat arbitrarily define the 
signal bandwidth (Figure 11 2b) to be the cut-off frequency, where eje, = 
707%(-3dB)or?J/c* = 50% (the half-power point) 

Notice that frequencies above /* will obviously pass (though attenuated) 
through the filter, and therefore are not really cut off For noise, it is convenient 



Figure 11^ Signaland noise bandwidths of low-pass filter (a) circuit, (b) Bode 
plot 
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to think in terms of an equivalent noise bandwidth, B„, defined by the relation- 
ship 

Bn = ^JV(jco)lM/ (11.5) 

where f/(j©) is the frequency response function of the system and G is a gain 
parameter suitably chosen to be a measure of the response of the system to 
some parameter of the signal: for low-pass systems (e.g. Figure 11.2b) G is 
usually taken to be the zero-frequency (d.c.) gain; for band-pass responses, 
G is usually made equal to the maximum gain. 

Using the above definition, and taking G to be the zero-frequency gain 
(i.e. unity), we can readily calculate that for the simple RC filter shown in 
Figure 11.2a: 


B„=l/4RC Hz (11.6) 

Noise, of the stochastic form, is reviewed in relation to instrument systems 
in Fellgett and Usher (1980). 


11.3 SIGNALS AND SIGNAL-TO-NOISE RATIO 

Suppose we were to look at a complex waveform on an oscilloscope. What is 
the signal? Is it the complete waveform? The peak (or r.m.s. or average) amp- 
litude? The depth of modulation? The implied frequency spectrum? The 
difference in time or amplitude between two features of the waveform? The 
answer, of course, is that the information-bearing signal could be any or none 
of the above. In this chapter, we will restrict ourselves to some commonly 
encountered types of signal where enhancement is often required. Together 
with the enhancement technique normally used, these are: 

(a) base-band (d.c.) signals: low-pass filtering or autocorrelation; 

(b) amplitude modulated signals: band-pass filtering or phase-sensitive 
detection; 

(c) repetitive (not necessarily periodic) swept signals: signal averagers; 

(d) photon, electron or ion beam signals; photon-counting systems. 

The word signal is often used rather ambiguously to mean either the total 
signal being measured or a noise-free, information-bearing component of it. 
The following definitions should allow us to avoid such confusion. We will 
normally talk in terms of a total signal consisting of an r.m.s. signal component 
(S) accompanied by an r.m.s. noise component (iV). Thus 


Signal-to-noise ratio, SNR = S/N 


(11.7) 
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Note that 

measurement uncertainty or inaccuracy = (1 1 8) 

Signal-lo-Noise improvement ratio (SNIR) = (^1 9) 

For unity gam (i e 5^ = S,), band-limited white input noise of bandwidth 
Bai and output noise bandwidth 

SNIR = NJN^ = y/(BJB„,) (11 10) 


IM NOISE MATCHING AND PREAMPLIFIER SELECTION 


All preamplifiers add noise Whether this additional noise is significant will 
depend, of course, upon the noise level from the signal source Since uncor- 
related noise adds vectorially (in an r m s fashion), the preamplifier noise can 
be neglected if it is less than about one-ihird of the source noise 

V[(10)' + (03)*]= 10 

We can think of a practical preamplifier as consisting of an ideal noise free 
amplifier with a (frequency-dependent) noise-voltage generator of voltage 
spectral density (V/^Hz), and a noise current generator of current spectral 
density (A/^^Hz), connected to its input as shown in Figure 11 3a Figures 
1 1 3b and 1 1 3c, respectively, show separately the gain seen by the amplifier 
interna! noise voltage and current generators Any mput shunt capacitance 
(see Figure 11 3b) will decrease the input impedance and cause output noise 
that mcreases with frequency if Zf is resistive 
The preamphfier noise may also be defined (Faulkner, 1966) in terms of an 
equivalent senes noise resistance R,, and an equivalent parallel noise resistance, 
R , , where (from equation (112)) 




and (from equation (1 1 3)) 


R, = — 5 — ohms 


We can also define the noise figure (NF) of the preamplifier to be (in dB) 
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Zf 



(c) 

Figure 11 3 Amplifier noise: (a) equivalent circuit, (b) voltage noise, (c) current noise 


A perfect or noiseless preamplifier would have a 0 dB noise figure. Figure 1 1.4 
shows the noise figure contours that result when the noise figure o a practica 
preamplifier is plotted as a function of source resistance and frequency, otice 
from equation (11.11), that with high source resistance, RJRs 

( Rs 
NF ci: lOlogiof 1 + ^ 

and the amplifier noise current, i„, predominates. With low source resi^ances, 
the amplifier noise voltage, e„, becomes the major noise soume. erever 
possible, preamplifiers should be chosen so that their 3 dB noise-figure con our 
encloses the expected range of source resistance and frequency. 

For a given preamplifier, the optimum source resistance, R ^ , is given by 

R,(opt) = ^ = ^{R.Rd ohms (11-12) 

Note that adding a series or parallel resistance between the signal source and 
the preamplifier always reduces signal and adds noise, and so cannot e use 
to obtain a better match. 
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Figure 1 1 4 Typical noise figure contours for a 
high input impedance preamplifier (reproduced by 
permission of EG&G Princeton Applied Research 
Corporatjon) 


Preamplifiers can be classified in many ways, one basic division, for example, 
IS between differential input and single^ended input All other things being 
equal, a differential preamplifier generates 3 dB (41 4%) more noise than a 
single-ended version However, this disadvantage is significant only m situations 
where preamplifier noise predominates and, in many cases, is outweighed by 
the flexibility of a differential input and its ability to remove ground-loop 
problems (see Section 115) 

Transformers are often used to match very low source impedances (0 1 fi- 
1 kfi) Figure 115 shows an ampbfier with an optimum, source 

resistance value of 1 MH being matched to a 100 fi thermopile by means of a 
100 1 voltage step-up transformer (10,0(X) I impedance transformation) 
Note that, in general, such noise matching does not result in the same circuit 
values as would power matching, that is, the ampbfier input resistance is not 
normally equal to ^/{RgR,) Transformers should be avoided if possible, since 
they reduce frequency response, may pick up magnetically induced inter- 
ference and may be microphonic 
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For sources of approximately 100 Q-10 kQ, preamplifiers are available 
which use an input stage consisting of multiple bipolar transistors connected 
in parallel to provide a lower value of yJ(Re R\)- Such preamplifiers avoid the 
bandwidth constraints imposed by input transformers. For higher impedance 
sources (1 kD-100 MQ), preamplifiers usually employ junction-FET’s as 
input devices and are available as voltage preamplifiers, charge amplifiers 
(for use with capacitive transducers), or current-input (transresistance) ampli- 
fiers. (See Figure 11.6). 






Figure 11.6 Amplifier configurations: (a) voltage, inverting; (b) voltage, non- 
inverting; (c) charge; (d) current input (trans-resistance) 
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In Figures 1 1 6a and 1 1 6b any cable capacitance or stray capacitance, 
C., will form a low pass filter with the source resistance, i?„ having a -3 dB 
frequency given by 


1 

“ 2nR,R,CJ(R, + RJ 


(Figure 1 1 6a) 


or 

-^‘=5^ (Figure 11 6b) 

In Figures 1 1 6c and 1 1 6d, such shunt capacitance appears at first sight to 
ha\e no effect since it is effectively shorl-circuited by the virtual-ground input 
However, as shown m Figure 11 3b, shunt capacitance will cause a detenora 
tion in the output SNR and also (by introducing an additional pole mto the 
loop gam) may cause ringing in the amplifier response, or even oscillation 
By careful design (which usually includes adding a capacitor across the feed- 
back resistor) these effects can be minimized and with high source impedance, 
commercial current and charge amplifiers usually provide significantly greater 
bandwidth than can corresponding voltage amplifiers 


11,5 INPUT CONNECTIONS, GROUNDING AND SHIELDING 

Ideally, all grounds should have a zero-impedance connection to each other 
and to wet earth, in practice they do not Due to voltage drops across their 
finite impedance to earth, capacitively or inductively coupled interference, 
and other reasons, each ground (ends to be at a different potential from other 
nearby grounds If two (or more) such adjacent grounds are connected together 
to form a ground loop (Figure 11 7a), then the potential difference between 
the grounds will cause a circulating current The potential difference between 
grounds (Ccm). is called the common-mode source smce it is common to both the 
signal (via loop 2) and ground (via loop 1) inputs of the preamplifier 
Figure 11 7b rearranges the mrcuit of Figure 11 7a and assumes the signal 
source, c,, to be zero Note that the low resistance of the coaxial cable shield 
(braid), is in parallel with the senes combination of the source resistance, 
Rj, the coaxial cable centre conductor resistance and the preamplifier 
mput impedance, Z,n Under nonnal circumstances (R^ -j- R„ + > Rcf 

and Z,o $> (Rj + R„), so that as shown m Figure 1 1 7c, the common-mode 
voltage dropped across R^g is also applied across the preamphfier mput ter- 
minals More generally, with e, = 0, the preamphfier mput voltage, e,a, is 
given by 


= Cj 


‘^cg + Kg + Rpg 


(1113) 
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/^GROUND 
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PREAMP . 
GROUND /77 



preamp" 


Rs 5 = SOURCE GROUND RESISTANCE 
Rpg = PREAMP GROUND RESISTANCE 
Rs = SOURCE RESISTANCE 
Rj.5 = CABLE RESISTANCE (SIGNAL) 
Rco = CABLE RESISTANCE (SHIELD) 
Z|N = PREAMP INPUT IMPEDANCE 
ecm= COMMON MODE SOURCE 
es = SIGNAL SOURCE 
e = PREAMP INPUT SIGNAL 

gem RcG 


FOR 

Zm ^Rs+f’es 

AND 

z,„» Rea 



Figure 11.7 Ground loops; (a) physical occurrence; (b) schenrat.e equivalent circuit; 

(c) reduced equivalent circuit 

From equation (11.13), this common-mode input to the preamplifier can 
be removed (i.e. = 0) by making: 

(a) e,„, = 0. This can be attempted by grounding the source 
to the same ground point, and shielding to remove 

ively coupled interference, but the procedure is rarely comp p„™nlifier 

(b) 0. The usual approach here is to bolt both source and P eamphfier 

chassis to a large metal plate. Unfortunately, it is air y anart on 

large potential differences between points a centimetre or 
a large metal plate, such as a mounting rack. j „ onnH 

(c) Rgg = 00 . Floating or disconnecting the source from groun 

(d) r=t*;\r.:fcniaya,sob=flo«^^ 

powered. Note that disconnecting the power-line grou mnsists 

strument can be extremely dangerous. In many instruinen ® p® . .^ 

of an internal 10 Q-l kQ resistor that can be switched into 
effectively float the amplifier input terminals. 
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FOR 

Z H* 

AND 

Z M» >'* Res* 
AND 

Zh» 2,Ht»Rc« 




Figure 118 DifTerencial preainpbfier used v.Tlh single ended source (a) 
physical occurrence (b) schematic equnalent circuit, (c) reduced equivalent 
circuit 


Figure 118 illustrates the use of a differential amplifier with an unbalanced 
(single-ended) source to eliminate or reduce ground loop problems As m 
Figure 117, the circuit simplification assumes that the input impedance of 
each side of the differential amplifier and Z,„b) is much larger than source 
or cable resistances At low frequencies this differential connection results m 
equal common mode voltages at the amplifier’s input terminals (A and B), 
and the amplifier’s ability to discriminate against common mode signals 
(le Its common mode rejecuon ratio, CMRR or CMR), will determine the 
effectiveness of this configuration m suppressing ground loop mterference 
At higher frequencies, the cable capacitances will act with the unequal re 
sistances m the A and B input circuits to form unequal low-pass filters, so that 
Ca will no longer be equal to and there will be a spurious differential (A-B) 
input to the preamplifier Though cable resistances and capacitances are shown 
for convenience as lumped parameters, it should be remembered that m fact 
they are distnbuted As shown in Figure 119, high frequency unbalance 
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SOURCE 



(bl Ic) 

Figure 11.9 Differential preamplifier used with balanced source: (a) physical occur- 
rence; (b) schematic equivalent circuit; (c) reduced equivalent circuit 


problems can be avoided by using a balanced source. Specific comment on 
connecting stages together is to be found in (Morrison, 1977). 

To end this section, here are a number of miscellaneous recommendations 
regarding good wiring and grounding practices. 

(a) Keep cable lengths short; for difierential connections, keep them equal 
and following the same route. 

(b) Interference can be coupled into the ground (shield) or outer conductor 
of a coaxial cable. Consider coiling the cable to form an RF choke to 
suppress high-frequency interference of this kind, use a transformer, 
or use a balun (which allows d.c. continuity). 

(c) Remember that a loop of wire acts as an antenna; reduce the area of such 
loops as much as possible. 

(d) Separate low-level signals/cables from noisy ones. Where such cables 
must cross, cross them at right angles and with maximum separation. 
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(e) For non-coaxial connections use shielded twisted wire-pairs 

(f) Consider placing low-noise instruments m a shielded (screened) room 
when they are used with high-energy RF sources, such as pulsed lasers 

(g) Keep analog and digital grounds separate 


11.6 BANDWIDTH REDUCTION OF BASEBAND (d.c.) SIGNALS 

The term d c signal is often used (and will be in this chapter) to mean a signal 
which has a frequency spectrum that includes zero frequency (d c ) Technically, 
of course, a dc voltage or current is unvarying and, therefore, cannot carry 
information (other than that it exists) Such signals are also termed baseband 
signals, particularly when they are to be used to modulate a carrier frequency 
The simplest way to improve the SNR for such signals is to use a low-pass 
filter to reduce the noise bandwidth to the point where any further reduction 
would also change the signal to an unacceptable extent 

Figure 11 10 shows a typical source and preamplifier system for such a 
pseudo-d c signal We will use this circuit to show how the output SNR may 
by estimated and also how the SNR may be improved by reducing the noise 
bandwidth 

In this example, it is assumed that the photomultiplier tube (PMT) anode 
current consists of both a 5 Hz signal component (i, — 1 nA t m s ) and a d c 
component (1^^ =5 nA), typically, such dc currents are due to stray light 
and dark/leakage currents The adjustable direct-current generator (/„) is 
used to null (zero offset) the dc component of the PMT current, that is, /„ 
IS made equal and opposite to ^ This kind of zero suppression is often called 
background subtraction Notice that /„ must be readjusted manually each time 
the background changes 
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The coaxial cable connecting the PMT and preamplifier has capacitance. 

Notice that the virtual-ground input of this preamplifier configuration offers 

the following advantages in addition to those discussed in Section 11.4: 

(a) With zero volts across it, the cable capacitance cannot be charged and the 
cable will, therefore, be less microphonic than otherwise. 

(b) Since the PMT anode voltage is clamped at zero volts, the anode-to-last- 
dynode voltage is also held constant regardless of ij (assuming that the 
dynode voltage remains fixed). Signal currents will, therefore, not change' 
the PMT gain. 

There are five, uncorrelated sources of noise in this circuit. These are; 

(a) The d.c. component of the PMT current is produced by integrating 
anode pulses each of charge Q = Ae where A is the mean PMT gain. The 
interval in time between successive pulses arriving at the anode is random 
and governed by Poisson statistics (see Section 11.10.2). Assuming no 
additional dynode noise in the PMT, then the r.m.s. value of the PMT 
shot noise current spectral density, i„i, is given by 

ini = ^/i2AeIa,J = 

For A = 10® (say), i? = 10”^ Q, e 1.6 x 10"^^ C and /^.c. = 5 nA, the 
resulting output noise voltage density, is given by 

Cni = Rim = 10’ X 10^ X V2e/dc 10^° x 4 x 10"^^ 

= 4 X 10"^ = 400/iV/7Hz 

(b) For purposes of this example, we can assume that the zero-suppress 
current, /„, is obtained from a transistor current source circuit so that it 
has a shot noise current spectral density, i„ 2 , given by 

= 7^ = V(2e/<,.,.)(= ini/10') = 4 X 10-'^A/7^ 

Note that though f^.c. = the shot noise component from the PMT is 
much larger than that from the transistor current source. The resulting 
output noise voltage spectral density, e„ 2 , is given by 

e „2 = Ri „2 10^ X 4 X 10“ = 4 X 10“’ = 400nV/7Hz 

(c) The feedback resistor, R, will generate (at T = 290 K) a Johnson noise 
output voltage density, e„ 2 , given by 

en3 = Ji^kTR) ~ 4 X 10“’ = 400nV/7Hz 

(d) At 5 Hz, a typical value for the spot noise voltage density of the amplifier’s 
internal noise voltage generator is 30 nV/^Hz. This amplifier voltage 
noise will experience unity gain (see Figure 11.3b) since Zi„ (the PMT 
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current source) is very high The output noise voltage density, ^04, due 
to this noise source is therefore given by 

e„4 = 30nV/^Hz 

(e) At 5 Hz, a typical value for the spot noise current density, ins , of the ampli- 
fier internal noise current generator is 5 fA/^Hz The resulting contribu- 
tion, Cns. to the amplifier output noise is given by 

= 10'’ X 5 X 10-^5 = 5 X 10“® = SOnV/^Hz 
The total output noise voltage spectral density, e„, is given by 
en = Jieni + ell + + eli + e^s) 

Since ell > ^n3» el^, and then 

e^ =s e^i 

and the system is said to be detector limtied or shot noise limited An 
electrometer, an instrument characterized by extremely low leakage 
currents, is often used as a low-noise amplifier m d c measurements of 
this kmd 


In Figure 1 1 10 the parallel resistor and capaator in the feedback loop cause 
the curuit to act as low-pass filter of time constant RC seconds, so that the 
—3 dB cutoff frequency is given by l/2nRC and = l/4iJC(see Section 11 2) 

IT no discrete capacitor is connected across R, the typical stray capacitance 
will (say) be about C = 15 pF, so that RC = 10’ x 15 x 10”“ = 25 /is 
and = 10* Hz. The output noise voltage {£„) will, therefore, be 
En = e,y/B„ = 4 X 10-* X ^10* = 40mV 
The output signal 

e, = j,i? = 10”’ x 10’ V = 10 mV 

Therefore, 


The capacitance can be increased to 2 5 nF by addmg discrete capacitors so 
that the noise bandwidth becomes — 10 Hz. The — 3 dB comer frequency 
will now be at 6 4 Hz (1 e 10 x 2/n) so that the signal (frequency is 5 Hz) is not 
significantly attenuated The output noise voltage (£J is now reduced to 

= 4 X 10"* X 7(10) ~ 1 26 mV 


SNR=^: 


10 8 
126“ 1 


and 
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(b) 



Figure 11.11 Low-pass filter characteristics: (a) filter types (all have same ENBW); 
(b) frequency responses; (c) time responses to voltage step input 


so that (see equations (11.9) and (11.10)) 


SNIR = 


SJN, 

S,/N, 



/lOOOO 

~ 32 

/ 10 

~T 


The roll-off rate of a low-pass filter may be increased by adding more RC 
sections (see Figure 11.11a). Care should be taken in using some multipole 
filter configurations (Chebyshev or Butterworth, for example), since many 
such filters have undesirable overshoot characteristics (see Figure 11.11c). 
Notice that the term time constant (t) is meaningful only in connection with a 
single RC filter section and, even then, does not adequately convey a sense of 
the response time of the filter. With a voltage-step input, for example, such a 
single RC section requires about five time-constant intervals for its output to 
rise to within 1 % of its final value. 


11.7 AMPLITUDE-MODULATED SIGNALS; THE LOCK-IN 

AMPLIFIER 

Most measurement systems are troubled hy 1/f noise. By amplitude modulating 
the measurand (quantity to be measured) at some reference or carrier frequency, 
/r, the output noise can often be reduced and d.c. drift problems avoided (see 
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Figure 1 1 12) In optical systems, for example, rotating or vibrating mechanical 
chopper blades are often used to periodically block a light beam and thereby 
square>wave modulate the signal amplitude — even though, m most cases, 
such choppmg means losing half of the light (signal) Measunng instruments 
that respond only to the modulation provide auiomaitc background subtraction, 
as withdc systems, however, the noise component of the background remains 
Such modulation also allows the use of transformers to noise>match preampli- 
fiers to loft resistance sources 

As with baseband signals and low-pass filtering, the SNR of a noisy ampli 
tude-modulated signal can be improved by bandwidth reduction — m this case a 
band-pass filter is commonly used In most applications, earner frequencies 
are chosen from the 100 Hz-10 kHz range, where preamplifier and environ- 
mental noise IS lowest, care should also be taken to avoid frequencies occupied 
by harmonics of the power-line frequency A second-order band pass filter 
(see Figure 11 13) is specified by its resonant or centre frequency,/,, and its 
selectivity, Q (quality factor) For a given value of/,, the higher the Q, the 
narrower the filter width 

The — 3 dB frequencies are at / + ^ and signal bandwidth (BJ is defined 
by 

For a second-order band-pass, the signal bandwidth and the equivalent 
noise bandwidth (B„) are related by 


so that, 




(1H5) 
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The band pass filter has associated with it an cfTective time constant, r, where 
(as with the low-pass filter case discussed m Section 112) 


so that, 

Br, = = i7r(2/2Jir) = I/2 t 

Also, from equations (11 I4) and (II 16) 

T = \flnj, = QM, 


From equation (11 15), we can see that the higher the Q, the smaller the noise 
bandwidth and, therefore, for white noise or other broad-band noise inter- 
ference, the smaller the noise and the better the SNR With a band-pass filter 
implemented by active RC (or LC) circuitry, frequency-stability problems limit 
the maximum practicable value of Q to about 100 
The lock in amplifier (EG & G, 1) is in part, a band-pass filter-amplifier 
that overcomes the Q limitations of conventional circuits, noise bandwidths 
of less than 0001 Hz and Q values of 10® or greater are easy to implement 
The lock-m amplifier can also provide amplification of more than 10® (180 dB) 
The term lock’in comes from the fact that the instrument locks m to the frequency 
(/,) of a reference signal With an external reference signal, a lock-in acts as a 
tracking band-pass filter and detector with a centre frequency equal to that of 
the reference (/,), it will automatically track changes in/, and can be used m a 
frequency-scanning mode if desired Commercial instruments are available to 
cover a frequency range of about 0 1 Hz- 50 MHz 
Though not all lock-m amplifiers use a phase-locked loop in their reference 
channel, most single-phase lock-ins may be represented by the simplified 
block diagram shown in Figure 11 14 TTie reference input waveform to the 



Figure 11 14 Basic lock-m amplifier (simplified) (reproduced by permission ofEG&G 
Princelon Applied Research Corporation 




SIGNAL-TO-NOISE RATIO IMPROVEMENT 


451 


lock-in may be of almost any waveshape and its zero crossings are used to 
define zero phase = 0). The output of the phase-locked loop circuit is a 
precise square-wave, locked in phase to the reference input, and at a frequency 
/j. Normally, /a = /r (the reference frequency); most lock-ins also provide a 
second-harmonic mode where /a = 2f^ and this mode is often used for derivative 
(signal rate of change) measurements. 

All lock-ins use a phase-sensitive detector (PSD) circuit and all PSD circuits 
consist of nothing more than a mixer followed by a low-pass filter. The output 
of a mixer ( 63 ) is the product of its signal (cj) and gating (ca) inputs, that is, 
63 = cjCa; the phase difference between these two inputs can be precisely 
adjusted by the phase-shifter circuit in the reference channel. For use in lock-in 
amplifiers, mixer circuits must be capable of withstanding large amounts of 
noise (i.e. asynchronous signals, /j /a) without overloading. The term 
dynamic reserve is used to specify such noise overload performance (see Figure 
11.15). Dynamic reserve is defined as the ratio of the overload level (peak 
value of an asynchronous signal that will just cause significant non-linearity), 
to the peak value of a full-scale synchronous signal. Dynamic reserve is often 
confused with dynamic range, which is the ratio of the overload level to the 
minimum detectable signal level. 

The d.c. drift of both the mixer and d.c. amplifier may limit the minimum 
detectable signal and the gain of the d.c. amplifier should, therefore, be mini- 
mized to provide optimum output stability; a.c. gain should be used to provide 
most of the overall instrument gain required. Such a gain distribution is pract- 
icable and desirable for use with clean signals. With noisy signals, however, 
the a.c. gain must be reduced to provide increased dynamic reserve, and the 
d.c. gain increased proportionately. Most high-performance instruments 
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Figure 11.15 Dynamic range and dynamic reserve of a lock-in amplifier 
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Figure 1 1 16 Switching mixer waveforms (reproduced by permission 
of EG5.G Princeton Applied Research Corporation) 

provide the controls to allow such a trade-off between dynamic reserve and 
output stability 

Due to non-linearity and other problems, a linear multiplier type of mixer 
cannot provide the dynamic range required in a (commercial) lock-m amplifier 
Consequently, mixers arc invariably of the switching type, shown in Figure 
11.16. The switch shown in this figure will be in position A during positive 
half-cycles of the square-wave dnve waveform and m position B during negative 
half-cycles. When the signal and dnve waveforms have a common frequency 
component, as shown, the mixer acts as a synchronous rectifier and produces 
a phase-scnsitne dc. output. Outputs arc shown for four different phase 
relationships Notice that the mixer dc. output can be adjusted from zero to 
±(2/7t)£, by varying the phasc-dificrcnce (0| — i^j) The square-wave dnve 
has an efiectivc amplitude of ± I and contains all odd harmonics of the fun- 
damental frequency of the square-wave. The output of a mixer, therefore, is 
composed of a large number of frequencies (see Figure 11.17) Thus, /i + 
/j ./i + 3/2 ,/i + 5 / 2 , . . , are sum frequencies - /j ,/, - 3/2 ,/i - 5 / 2 , • • . . 
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Figure 11.17 Switching mixer operation (reproduced by permission of EG&G Princeton 

Applied Research Corporation) 


are difference frequencies. Note that when/j — (2« + l)/ 2 > of the 
ence-frequency exponents of the mixer output will be at 
or d.c.. A mixer will, therefore, produce a phase-sensitive 
/, = (2ii + l )/2 and for n > 0, these outputs have magnitudes that are nversely 

proportional to their harmonic number (n) and are 

responses of the mixer. Lock-ins respond to the average fu -wa 

value of the input signal but are usually calibrated in r.m.s.-a sinusoidal input 

or sinusoidal front-end response is assumed. 

The PSD input (cj) need not be sinusoidal. If ej were a ^ 

wave signal, for example, such as that resulting from choppe - ig 
ments, then ej would contain a large number of synchronous compon 
each of which would give rise to an output d.c. signal from t e . 

In a perfect mixer, only synchronous inputs can cause a d.c. output, n prac 
due to non-linearities of the mixer switching elements, a mixer can pro uc 
d.c. output with high-level noise inputs; even with no (zero) input, capaci i\ 
feedthrough can cause a d.c. output. Such spurious d.c. outpi^s , 

negligible in amplitude. However, at higher frequencies (above 10 kHz typically;, 
the magnitude of such a d.c. offset and its associated drift may become signiUcant. 

As we saw previously in Figure 11.17, the output of a mixer conteins a arge 
number of sinusoidal sum and difference frequency components. ( e num 
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Figure 11 18 PSD operation with synchronous signal (reproduced by permission of 
EG&G Princeton Applied Research Corporation) 


IS large rather than infinite, since the squareness of the drive signal is not perfect, 
and the drive signal does not contain all higher odd harmonics) The effect of 
the low-pass filter (see Figure U 18 ) is to remove all components of the mixer 
output which have frequencies beyond the filter cut-off When the filter time 
constant is set normally (so that the filter cut-off frequency (/j) is appreciably 
less than the fundamental frequency (fi) of the mixer square wave drive), 
the output of a PSD will contain only those difference frequency components 
having frequencies within (approximately) the equivalent noise bandwidth 
of the filter 

Suppose, as shown m Figure II 19 , that the PSD input (gj) is asynchronous 
(noise) of frequency /i = /j + A/ The resulting mixer sum and difference 
frequencies (ignoring harmonics for simplicity) will, therefore, be 2/2 + A/ 
and A/ respectively Only the A/ component may be low enough in frequency 
to pass through the low-pass filter and appear as output noise Suppose we 
change the frequency of this input noise to fi ~ fz ~ A/ The resulting sum and 
difference frequencies will respectively be 2/2 — A/ and — A/(= A/) Again, 
only the A/ component can appear as output noise and the low-pass filter 
‘cannot tell’ whether its A/ input resulted from an /2 -f Af input to the mixer oran 


LOW-PftSS 
f, -t* IHz FILTER(LPF) 

f. t,-Af 99 Hj uiyco — — — 



Figure 11 19 PSD operation with asynchronous (noisy) signal (repro 
duccd by permission of EG&G Princeton Applied Research Corporation) 
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— A/ input. In addition to its rectifying and phase-sensitive properties, 
therefore, the PSD filters noise as though it consisted of band-pass responses 
centered on all odd harmonics of /j (see Figure 11.20). Notice that each effective 
band-pass response consists of the output low-pass filter response and its 
mirror image; their centre-frequencies automatically track changes in the 
PSD drive frequency /j . 

Each band-pass response has an equivalent noise bandwidth determined 
by that of the low-pass filter. If the PSD input consists of white noise, the effect 
of the harmonic responses (2n + 1 = 3, 5,1 ,.. .) is to increase the PSD output 
noise by 11%. For square-wave signal inputs, the additional output noise 
(1 1 %) caused by the harmonic responses is more than compensated for by the 
increase in signal (23 %). If the PSD is used to measure a sinusoidal signal 
accompanied by white noise, a separate low-pass or band-pass filter, centred 
on/ 2 , inay be used in front of the PSD to remove the harmonic responses 
and thus the additional 1 1 % noise. The improvement in output SNR effected 
by the use of such front-end filtering on white noise is normally insignificant. 
With discrete frequency noise, however, front-end filters can be extremely 
helpful. By reducing the input noise before it reaches the PSD, the overload 


GAIN 



Figure 11.20 Frequency response of a PSD (reproduced 
by permission of EG&G Princeton Applied Research 
Corporation) 
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capability of the lock-in may be improved significantly beyond the dynamic 
reserve of the mixer and this additional dynamic range can also be used to 
provide increased output stability if desired For this reason some lock-ins 
provide adjustable low pass, high-pass, band-pass and band-reject (notch) 
filters, m addition to a flat frequency response (broad-band) mode Applications 
and explanation of operation of PSD sj^tems are also reviewed m Blair and 
Sydenham (1975) 

Heterodyning front-ends are also available With this approach, a fixed- 
frequency band-pass amplifier is used to protect the PSD and increase the 
overload capability of the lock-m In order to use such a fixed-frequency 
filter, another mixer is used to heterodyne the input signal frequency up to the 
centre frequency of the filter, two such heterodyning schemes are shown m 
Figure 1121 The advantage of this approach is that the instrument offers 
sinusoidal response (no harmonic responses) and overload performance 
approaching that of a tunable band-pass instrument, without the need for 
manual tuning Because of the phase-shift characteristic (see Figure 11 13) 
of their front-end band pass filters, however, manually tuned or heterodyning 
mstruments cannot provide the phase stability of a broad-band or flat lock-m 

Figure 11 22 shows a simplified block diagram of a mo phase or vector lock-in 
amplifier An additional quadrature (0 output channel has been added, 
consisting of a mixer, low-pass filter, and d c amplifier The reference channel 
provides quadrature gating inputs to the I (in-phase) and Q mixers The ortho 
gomlity of the two mixer dnves— that is, the accuracy of the 90® phase difference 
between them— is extremely important when measuring a small 1 signal m 
the presence of a large Q signal (or vice versa) Similarly, m a single-phase 
lock-m, the accuracy with which a 90® phase shift can be switched into or out 
of circuit (usmg the phase-quadrant switch), is equally important 

Most two-phase lock-ms provide a veclor/phase circuit (usually as an 
option), which computes the vector magnitude M, where 

M = + Q^) = JliA cos if>)^ + {A sm ^)=] = A 


where A is the signal amplitude, and 


0 = tan ‘ I 


= tan 


/ A sin(^, + 0,) \ 
^/lcos(^, + (/>,)/ 


+ <Pt 


where (f>, is the signal phase shift relative to the reference signal phase, and <j>, 
is the phase offset set by the phase shift controls Two-phase lock ms can 
therefore, display their output signal in rectangular or polar form, with the 
phase controls (<^,) allowing continuous wetor rotation Notice that asyn- 
chronous signals (/, 7^ /,) with beat frequencies within the low-pass filter 
response will provide d c outputs and the instrument, therefore, acts as a ivaie 
analyser Modem wave analysers are essentially vector lock-ms that are 
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IN PHASCCl) OUTPUT 



OUAOnATUftE (0} OUTPUT 


Figure 1 1 22 The two phase (vector) lock in amplifier 

optimized for convenience m measuring frequency components of a signal 
rather than recovering a signal from noise 

Figure U 23 shows a typical application for a two-phase lock-in In such a c 
bridge applications, the phase shift (<p,) can be set to zero, so that the m-pbase 
(/) signal responds only to the bridge resistance and the quadrature output 
(g) to the bndge capaatance The bridge <an then be balanced very simply 
by separately nulling K, and C, 



lock m amplifier (a) system arrangemrat, (b) vector relationships 
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As in the case of d.c. measurements with preamplifiers or electrometers, the 
SNIR to be expected from a lock-in depends upon the input noise bandwidth 
(B„i) to the lock-in, the noise bandwidth of the lock-in, and the noise 
spectral characteristics. For random white noise and unity gain, 

SNIR = 

where for a —6 dB/octave rate of roll-off for the lock-in output low-pass filter, 
Bno = 1/4RC, or for a — 12 dB/octave low-pass filter roll-off, = 1/SRC. 


11.8 SIGNAL AVERAGING 


1 1 .8.1 The Boxcar Averager 

The boxcar averager (EG & G, 2) is a sampling instrument that is used to enhance 
repetitive signals. Also known as a boxcar integrator or detector, the boxcar 
takes only one sample during each signal occurrence or sweep, and requires a 
trigger signal at a fixed time interval prior to the beginning of each such sweep. 
The heart of any boxcar is the gated integrator circuit, shown in simplified 
form in Figure 11.24. This circuit is simply an RC low-pass filter gated by 
switch Si (the sampling gate). As shown, the gated integrator has unity d.c. 
signal gain. 

If the gate is opened (i.e. Si closed) every T seconds for an aperture of tg 
seconds, then the duty factor y is given by y = tg/T = tgf where / = l/T. 
When C; is a voltage step, will rise exponentially as shown in Figure 11.25, 
curve A. Notice that the effective time constant is much longer than the 
real (ungated) time constant, RC, and is given by 



RC 

u 


(11.19) 


As we saw previously for the PSD of a lock-in amplifier, the noise bandwidth 
(Bno) of the gated integrator is simply that of the low pass filter, that is B„„ = 
1/4RC. 



Figure 1 1 .24 The gated integrator (simplified) 
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Fiffure 11 25 Gated integrator modes of operation curse A, Exponential 
averaging r ™ RC {»,/) curse B Linear summation 


For input uhite nois« limited toa bandvsidtb 8^ and unity gain, and where 

< l/if 

then 

SNIR = = JmCBJ (lUO) 

The time resolution of a boxcar measurement will improse (shorten) as the 
gate duration is decreased, until the point is reached where the input 
bandwidth (B„,) limits the resolution If we set 

r, = I/(2B„J or B„, = l/(2f^ 
then we obtain the widely quoted formula 

SmR = yJiARCBJ ^ ^(2RC/t^) (1121) 

With pulsed signals, the ability of a boxcar to separate temporally the signal 
from (most of the) noise is usually of much greater significance than such 
theoretical white noise considerations 

In the exponential aieraging or exponential weighting mode shown in Figure 
1 1 24, the output signal from the gated integrator favours the most recent samples 
and provides a dc output that follows the input at a reasonable rate In many 
wajs this mode of boxcar operation resembles a lock-m amplifier, in fact, if 
two gated integrator channels are used, one to sample signal plus background, 
the other set to sample the background only, then by takmg the difference 
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between the two outputs, we have essentially built a lock-in amplifier with 
adjustable duty cycle. 

Figure 11.26 shows a simplified schematic of a complete boxcar averager. 
When switch S 4 is moved to the A position, the circuit behaves as a true gated 
integrator rather than as a gated low-pass filter. In this mode, all samples have 
equal weight and, for a step input, the output will rise in a linear staircase 
fashion as shown in Figure 11.25, curve B. In this linear summation mode, the 
desired number of signal samples (m) is selected ; after m triggers have occurred, 
switch S 3 is used to reset the integrator (discharge capacitor C). Since the signal 
samples will add linearly, and random noise samples will add vectorially, 
after m samples of a constant amplitude signal (S) plus white noise (N), and 
after maximizing the gate width to suit the signal waveshape, the output SNR 
is given by: 


SNR„„, 


5i -l- 52 -b 53 -+-••• -t" fnS 

V(N? + Nl + Nl + --- + Ni)^ ^(mN^) 


so that 


-Vm = SNRi„> 


SNIR = 


SNR„„, 

SNRi„ 


/ SNR (m samples)\ 
V SNR (1 sample) / 


y/m 


( 11 . 22 ) 


Notice that for this operating mode, it is easiest to think in terms of time 
averaging since the equivalent noise bandwidth of the gated integrator circuit 
is not constant but will decrease with increasing m. 

The width (tg) of the gating pulse is set by means of the aperture-duration 
controls and circuit, and the delay between receiving a trigger and sampling 
the following sweep is adjusted by means of the aperture-delay circuit. If the 
aperture delay is set manually to a constant value, then the boxcar is in a 


SIGNAL 



Figure 1 1.26 The boxcar averager (simplified) 
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Figure 1 1 27 Boxar operation m scanning mode (rqsroduced by permission of 
EG&G Pnnceton Applied Research Corporation) 


stationary mode and v. ill sample the same portion of each successive signal— thus 
providmg an output corresponding to the amplitude of the signal at that point 
Alternatively in its scanning mode the aperture delay can be slowly and con 
tmuously changed by a voltage ramp from the scan ramp generator so that the 
sampling aperture is slowly moved across the entire signal (see Figure 11 27) 
In this mode the boxcar output is a replica of the signal waveform and the boxcar 
can be regarded as a time translation device that can slow down and recover 
fast waveforms 

In the scanning mode the aperture duration (r^ is not necessarily equal to the 
time resolution but rather sets the maximum resolution that can be achieved 
(assuming no input bandwidth limitation) if the scan is sufficiently slow For an 
amplitude resolution of withm I % of the full scale value a scan tune 7^ a signal 
(sweep) duration of T and a total effective boxcar time constant of Tb the time 
resolution Ir is given by 


r* — 5xgT/T^ 


(1U3) 


where 


Tb + T/) 

and Tf IS the time-constant of any additional filtenng used m the ir^trument 
Boxcar averagers can resolve very fast waveforms A 100 ps dual-channel 
boxcar averager using alternate signal sampling and baseline sampling in each 
channel is shown in Figure 11 28 Without its averagmg capability a boxcar is 
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Figure 11.28 Dual channel, 100 ps boxcar system (reproduced by permission of EG&G Applied Research Corporation) 
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Similar to a sampling oscilloscope and as shown m this figure, sample-and-hold 
plug-ms designed for use m such oscilloscopes can be used as fast front-ends for 
a boxcar 

11^.2 The Multipoint Signal A\cragcr 

The boxcar is a single'point averager it samples each signal occurrence (sweep) 
only once A nniliipoint aierager (Hewlett Packard, 1968) acts much like a large 
number of boxcars connected in parallel, since it samples many points (typically 
2‘° = 1024) during each signal sweep In such multipoint instruments, the 
analog storage capacitor of the boxcar is replaced by digital memory, each 
sample is digiti 2 ed and the new data are added to the data from previous sweeps 
already m the memory location corresponding to that sampling point 

Figure 1129 illustrates some typical waveforms and timing details for a 
multipoint averager, for simplicity / = 10 (je only ten samples/sweep are 
shown) The total signal duration (T) is given by the product of the number 
of samples/sweep (/) and the dwcll-time (gate width or sampling duration, t^) 
of each sample Notice that T is less than the total sweep duration (t) by the dead 
time (tj), and that there is usually a fixed delay time between receipt of a tngger 
pulse and the beginning of the first sample Although m most applications a 
multipoint averager is triggered at a constant rate (/ = 1/t), it is not necessary 
that the trigger be periodic Assume that the averager is set to continue averaging 
until m input sweeps have been sampled, at which point it will automatically 
stop 

Suppose we wish to recover the waveform of a noisy signal,/(f), where 
fit) = sit) + n(r) 

For the ith sample of the kth sweep, 

/(O = /(f* + if^ = sit, + If,) + n(f* + iQ (11 25) 

For any particular sample point (i), the input signal can be assumed to remain 
unchanged with each new value of A (i e with each new sweep) and the averaged 
signal will therefore be simply 

For random noise, samples (Jf,) will add vectorially, so that the r m s value 
(tr) of the averaged noise will be given by 

(I(W'" = (r> (1127) 

The averager output can be described by 


Qifk + = ms(if,) + 


(1128) 



SIGNAL . SIGNAL 

s(t) \ / + noise 
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SO that the output SNR is 


ms s , 

SNR.., =-i^ = —T- “t “ “ "■ 

(1129) 

The input SNR is simply 


SNR,„ = s(it,)/o 

(lUO) 

so that 



(1131) 

In order to consider a multipoint averager from a frequency domain or 
filtering point of view, we need to know us transfer function H(}co) We can 


determine H(juj) if we know the impulse response h{t), since //(joi) and h(t) are 
a Founer-transfonn pair 

We can determine /i(r) heunstically by the following reasoning In a multi- 
pomt a\ erager, tngger pulses are used to synchronize the signal sweeps and allow 
the signal samples to be coherently add^ (CO*ADDed) Mathematically, this 
action can be thought of as convolving the input signal, /(t), with a tram ofm 
unit impulses (triggers) spaced t seconds apart The a\ erager’s effective impulse 
response is, therefore, gnen by 


ft(0= (1132) 

By Founer transforming this expression for h{t), we find (Childers and 
Duilmg, 1975) that the averager’s transfer function is 


|/fOcu)|=: 


sin(ma)T/2) 

sin(o>T/2) 


(1133) 


Notice (from L’Hopital’s rule) that = m whenever at is an mtegral 
multiple of 271 Figure 11 30 shows the comb filter response of equation (11 33) 
forse%eral values <jfm Each band-pass response is centred atabarmoaic(n/c) 
of the sweep/tngger rate (If the tngger rate is apenodic, then this comb-filter 
concept becomes meaningless ) Since the peak transmission of each bandpass 
response is m, the — 3 dB points must occur at m/y/2, so that 


lH(jcu)| = 


sin(mQ>T/2) _ m 
sid((ot/2) ^2 


(1134) 


from which the — 3 dB bandwidth B for large values of m, is found to be 

B = 0886/(mT) (1135) 


Large vnlues of m are practicable, particularly at high sweep rates With a 
trigger rate (1 /t) of 100 Hz and m = 10® for example, the total measurement tune 
will beniT = 10® x 10“^ = 10* s Ji: 28 h, and B = 8 86 x 10“® Hz. 
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Thus far in this discussion of multipoint averagers, a linear summation mode 
of averaging has been assumed. That is, for the ith memory location, the average 
after m sweeps is given by 

(11.36) 

k=l k=l 

where 4 = fi.h + the value of the ith sample in the kth sweep. 

This algorithm has the advantage of being simple to implement digitally. The 
output averaged signal, however, continually increases with each new sweep; 
manual scale changing is required to keep the displayed output at a useful size, 
yet within the bounds of the CRT screen. A seemingly more convenient algorithm 
would be to normalize the data in memory after each sweep, that is, implement 

1 1 — A 

^k = = A-1 + (11.37) 

During each sweep, the data (A^-j) in each memory location are compared 
with the new sample value 4 and the computed value of (J^ — A*. i)/k is added 
to memory to form the new average value A*. Because of practical difficulties 
in implementing a division by k during or after each sweep, the algorithm shown 
in equation (11.37) is often approximated by 

A, - A,., + (11,38) 

where / is a positive integer selected automatically such that 2^ is the closest 
approximation to k. Notice for /c = 6 for example, that the closest I-' values are 
2^ = 4 or 2^ = 8. Though this normalized averaging mode is very slightly slower 
than the summation mode in enhancing the signal, we can assume that SNIR = 
for all practical purposes. Note that the discrepancy between k and 2-^ 
increases as larger values of J are used to deal with very noisy signals. In com- 
pensation, this averaging mode provides a stable, constant-amplitude display 
from which the noise appears to shrink with time. 

If we wish to recover and monitor slowly varying noisy signals, the algorithm 
of equation (11.38) can also be used for exponential averaging if J is made a 
manually selectable constant. When J = 0, then 2’' = 1 and A^ = I with this 
setting, the input signal may be monitored in real time, since it is digitized and 
stored without averaging. In general, selecting a value of J will establish an 
effective time constant, Xj, where 


or 


= 


- ln(l - 2-^ 


(11.39) 


2-^ = 1- exp(-yTj) 


(11.40) 
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Figure 11.30 The comb liltcr uclion of a multipoint 
averager 


The larger the value of J .selected, the greater the signal enhancement, and 
the more slowly the averager responds to changes in the input signal. For a large 
number of sweeps (EG & G. 3) the SNIR is given by 

Figure 11.31 shows the simplified block diagram of a multipoint averager. It 
is common to include a low-pass filter in the analog input channel with a — 3 dB 
cut-off frequency controlled by the dwell-time setting. A typical example 
might be 


/c l/(2fK) 

that is, one-half of the sampling frequency. Such filters arc used to improve the 
input SNR rather than as anti-aliasing filters. 
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Fieure 11 31 The mullipoinl signal averager (simplified) 


The maximum number of sweeps (m,„^ that can be digitized by an averager 
without data o\ erflow, if the input signal is full scale and noise-free, is gi\ en by 
2* * where h is the memory size (bits) and c (bits) is the resolution of the A/D 
converter (ADC) For a 9-bit (8-bit -i- sign) ADC and a memory of N words, 
each of 28 bits, then 

= 2^-* = = 2’’ = 524,288 

In some instruments with artifachrejectton capability, each new signal 
sweep IS digitized and placed m a bufler memory Before addmg the buffer 
contents to mam memory, each buffer-memory location is checked for overflow, 
the buSer contents are discarded should an overflow (artifact) exist 
Suppose that the input SNR to an averager is 1 10, that is, the r m s noise 
(ff) IS ten times larger than the peak signal (S) The a c gam before the ADC, 
must be set such that the noise peaks do not exceed full scale For Gaussian 
noise, It is 99 9% probable that the peak noise (Np) amplitude is less than five 
timesgreaterthan therms noise amplitude, that IS, Np/ff ^ 5,sothatWp/S $ 50 
Assume that the mput gam is set such that Np is just equal (say) to the full 
scale mput level of a 9 bit ADC Assume ako that the resolution of the ADC is 
2^ (= 512), and the memory size h = 2**, as before Of these 9 bits, 6 bits 
( = 2® = 64) will be required as dynamic reserve (i e to handle the mput noise), 

then, the maximum number of sweeps before overflow would be 

= 2“ == 34 X 10’ 

Under the conditions of this example, the output (vertical) resolution will 
not be lumted to 3 bits Random noise accompany’ng the signal will dither the 
ADC, that IS, the noise will modulate the quantization levels of the ADC so as 
to provide a resolution that increase as m increases Note, however, that with 
out noise and with the same full-scale settmg, the averager output would 
mdeed have a 3-bit amplitude resolution White noise can be added deli- 
berately to signals that are clean, in order to improve resolution beyond that 
of the ADC (Horhck, 1975) 
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It is useful to compare boxcar and multipoint averagers. For dwell times of 
about 1 /IS or longer, the multipoint averager typically needs less than one- 
thousandth of the measurement time needed by a boxcar to recover a wave- 
form; on the other hand, the boxcar is .the only choice for gate widths (dwell 
times) of 1 ns or less. For dwell times in the 1 ns-1 /is range, the choice is between 
a boxcar or a transient recorder interfaced to a multipoint averager. Such 
transient recorder-averager combinations are usually less time-efficient 
(i.e. T §> /tg) than a multipoint averager alone, due to slow data transfer. 

11.9 CORRELATION 

For our purposes in this chapter, correlation analysis is a method of detecting 
any similarity between two time-varying signals (Honeywell). Autocorrelation 
consists of the point-by-point multiplication of a waveform by a delayed or 
time-shifted version of itself, followed by an integration or summation process. 
Mathematically, the autocorrelation function, Rxx(t), of a time-varying function, 
f(t), is given by 

R,,(t) = hm ^ fit) fit + T) dt (11.42) 

where t is the lag value or time shift between the two versions of/(f). 

Cross-correlation involves two waveforms and consists of the multiplication 
of one waveform, /(t), by a time-shifted version of a second waveform, git), 
followed by integration or summation. The cross-correlation function, Rx/r), 
is given by 

Rxyi^:) = lim f fit)git + t) dt (11.43) 

Notice that cross-correlation requires two input signals, as is also true in the 
case of signal averaging (where a synchronizing input is required in addition 
to the signal input). Also as with an averager, a cross-correlator preserves 
signal phase information. Unlike the averager, however, the cross-correlator 
output waveform, the correlogram, is affected by the waveform of the second 
input signal — an undesirable and unnecessary complication for signal recovery 
applications since a multipoint averager may be used. Cross-correlators are 
normally used in flow or velocity measurements; they are used but rarely for 
simple signal-recovery purposes. (Ignoring the lock-in amplifier and boxcar 
integrator, both of which can be regarded as a special type of cross-correlator.) 

Phase information is lost in an autocorrelation function, as is also true for 
its Fourier transform, power spectral density. This lack of phase information 
means that in some cases, the input signal responsible for a given correlogram 
must be deduced by intelligent guesswork. For example, as shown in Figure 
11.32, the autocorrelation function for band-limited white Gaussian noise is a 
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noise 


spike-bke peak at t = 0, with a width that would decrease as the ooise baud 
width increases A similar correlogram could have resulted from an input 
consisting of a single narrow pulse, the narrower the pulse width, the narrovier 
the correlogram spike Thus the pulse and the band^Iimited white noise have 
similar power-density spectra. (The difference between them is that frequency 
components of the noise have random phase relationships ) 

A simplified block diagram of an aulocorrelator is shown in Figure 11 33 
The ADC will digitize the input signal once every lag mterval or dwell tune, 
tg Each such A/D conversion will require a conversion time, fe» where 4 t. 
The output digital word from the ADC, corresponding to the latest sample, 
provides one input to the digital multiplier and also is shifted, as word 0, 
into an 7V-word shift register During this shift operation, the last word m the 
register, word (N — 1), is shifted out and discarded and the former word 
(N — 2) becomes the new word {N — 1) The control and timing circuits 






Figure 1 133 The autocorrelator (simplified) 
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then cause the shift register to step N times, recirculating its contents one full 
cycle, and requiring a time interval t^. During each shift-register word 
(i.e. (n — 1), (n — 2), (n — 3), . . . , 3, 2, 1, 0) is applied sequentially to the other 
input of the multiplier, which multiplies each of these words by the word 
at its other input, word 0. Each multiplier output is added to the contents of 
the corresponding bin of the iV-bin main memory; for instance, word 0 x word 
(n - 16) -* bin(n — 16), word 0 x word 0 -> bin 0. After many such cycles, 
the contents of bin i (for example) of the main memory, will be the sum of 
products formed by multiplying each new signal sample by an itg delayed 
version of itself. Bin 0, for example, corresponds to the signal multiplied by 
itself with zero delay. 

The minimum time between successive samples is + Q- When tg ^ 
(tj. + Q, the correlator is said to be working in a real-time mode. When ^ 
(tj + Q, samples can no longer be taken every tg seconds and the correlator 
is said to be in a pseudo-real-time mode. For tg (t^ + O, the correlator is in 
a batch mode. 

The process of autocorrelation involves the concept of sliding a waveform 
past a replica of itself. For random noise, the two waveforms will match at 
only one point as they slide by each other, that is at r = 0, when they are per- 
fectly aligned. In contrast, a square-wave sliding by another square-wave 
will find a perfect match once in every period and will give rise to a triangular 
correlogram. More generally, signals that are periodic in time will produce a 
correlation function that is periodic in t. Suppose, for example, that the input 
to an autocorrelator is f(t) = A cos((ot). Then 

1 

= lim — A cos{(ot)A cos{cot -h T)dt = jA^cos(a>r) (11.44) 

T-oo2T J_7- 

Figure 11.34 shows the correlation function of a sine wave accompanied by 
band-limited white noise. Note that the peak value at t = 0 in this correlogram 
is the mean-squared value of the signal plus noise (i.e., S -f N). The mean- 
squared value of the sinusoidal signal component is given by the peak value of 



Figure 1 1.34 Correlogram of a noise sinewave 
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the Sine wave at t values where the white noise spike has damped to zero The 
SNR of the correlator output can, therefore, be determined since 


S 

iS + N)-S 


^=SNR 


Most importantly, notice that the signal and noise components have been 
separated by virtue of their different positions on the r-axis It is this separating 
ability that makes correlation a powerful signal recovery technique 


11 10 PHOTON (PULSE) COUNTING TECHNIQUES 


11 10 1 Introduction 

PMT’s (photomultiplier tubes) are used to measure the intensity or flux of a 
beam of visible photons (see Figure 1 1 35) With its photo-cathode removed, 
the PMT becomes an EMT (electron multiplier tube) and is widely used to 
detect ions and electrons One of the most important advantages of such 
detectors is that their high gain and low noise allow them to give one output 
pulse for each detected input particle Since visible-light measurements are 
perhaps most commonplace, we will consider as an example a PMT and pulse- 
counting system of the type shown in Figure 1 1 36 
The probability of each incident photon causing an output pulse from the 
PMT IS essentially equal to the quantum efficiency, C typically C ts between 
5 25 % In addition to the photon derived pulses (i e the signal pulses), there 
will be spunous noise pulses at the PMT output due to thermionic emission 



Figure 1 1 35 End window photomultiplier tube 
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f'lpurc 1 1 .36 Typical photon counting system (reproduced by permission of EG ttG 
Princeton Applied research Corporation) 


Noi.se pulses caused by thermionic emission from the dynodes will experience 
less gain and will be smaller in amplitude than pulses due to cathode emission. 
Tlic PMT output pulses arc amplified and presented to a pulse-height dis- 
criminator circuit, where the peak amplitude of each pulse is compared to an 
adjustable threshold or reference voltage. Ideally, the discriminator will 
reject all dynode-dcrived noise pulses and accept all cathode-derived pulses; 
in practice, the PMT gain is statistical in nature and cathode and dynode- 
dcrived pulses have overlapping amplitude distributions. The discriminator 
will therefore accept most cathode-derived pulses and reject most dynode 
noi.se pulses. Each accepted input pulse will cause a standardized output 
pulse. Such pulse-height discrimination also reduces the effect of PMT gain 
variations with time and temperature. 

Ratemeters arc used to give a continuous analog output voltage which is 
proportional to the discriminator output pulse (count) rate. Alternatively, 
digital counter circuits can be used to accumulate output counts for a pre- 
.sclcctcd measurement lime. Such counters allow very long integration times 
and when a digital output is required, they can avoid the loss of resolution 
inherent in D/A conversion. 


11.10.2 Poisson Statistics, Shot Noise, and Dark Counl.s 

Suppo.se we use our PMT to detect photons emitted from a thermal light 
source such as a tungsten filament lamp. The time interval between successive 
photons impinging upon the PMT photocathodc is random and governed by a 
Pi>i%\oi} distribution (sec Figure n.37a). TIic probability. P, of detecting n 
photons in a time r following the l.ist photon is given by 


Pin. I) 


CRiYc 




nl 


S ’e-^ 

’"nl 


(11.45) 







476 


HANDBOOK. OF NIEASUREMENT SCIENCE 



Figure ! 1 37 The Poisson dismbuiion (reproduced by permission of 
EG&G Princeton Applied Research Corporation) Curve A Probability 
of detecting n photons tn time f = (Rf)" 6xp(~Rt/n) R - 10* photons/s 
r s 10 ns and so Rf * I <t « -JiRi) Curve B Probability of gam 
magnitudePfx) * Af^e *V't' whcrex - dynodegain Af - meandynode 
gam - 5 and a - y/M 


where R is the mean photon rate (photons/second) and N *= ^Rt is the signal 
0 e the mean number of photoneleclrons emitted by the PMT photocathode 
durmg the time interval r) The noise, or uncertainly, in N is given by the 
standard deviation a, where 

a = yjm) = 

so that 


SNR=^ = VJV = 7(CRt) (1146) 

Notice that, as in all of the techniques examined in this chapter (with white 
noise), the SNR is again proportional to the square root of the measurement 
time (t) If we assume that there is no thermionic (dark) emission of electrons 
from the photocathode, then the photocathode (signal) current (m amperes) 
IS given by 


/pe = CRe 


(1147) 
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where e is the charge of an electron (=s 1.6 x 10 C). The signal-to-noise 

ratio (SNRk) of the photocathode current (/pe) is given by 

SNR, = V(CR<) . (11.48) 

Now the measurement time t has associated with it a frequency range A/, 
where 


t = l/2Af, 

so that 

SNR, = J = J {lellW) ^ ^(2ellAf) 

If we multiply both the numerator and denominator of equation (11.49) by the 
mean PMT gain A, then 


SNR, 


AI, 


pe 


A^(2eI,Af) V(2^e7,A/) 


SNR, 


(11.50) 


where is the d.c. anode current and SNRa is the signal-to-noise ratio of the 
anode current if thermionic emission and other dynode noise contributions 
are ignored. Note that the general expression for the shot noise of a d.c. current 
I is given by 


r.m.s. shot noise current = yJ{2AeIAf) (11.51) 

where A is the gain following the shot noise process. When A = 1, the expression 
simplifies to y/{2eIAf). Notice also that shot noise was present in the light 
beam itself and that the PMT quantum efficiency (C) degrades the SNR by a 
factor of 

In practice, with no input photons, the photocathode will emit electrons due 
to temperature effects. The dynodes will also emit thermionic electrons. The 
rate of such thermionic emission is reduced by cooling the PMT. Thermionic 
emission from the photocathode, that is, dark counts, can be further reduced by 
minimizing the cathode area and by selecting a photocathode material with 
no more red (long-wavelength) spectral response than is necessary. 

If the photocathode emits electrons randomly at a dark count rate R^, 
then the noise components of the cathode current will increase to + R^t) 

and the signal-to-noise ratio of the cathode current will degrade to 


SNR = 


CRt _ CR^t 

J(CRt -t- RdO V(C^ + ^d) 


(11.50) 


This will also be the PMT output SNR if dynode noise is assumed to be 
removed completely by pulse-height discrimination. For PMT’s equipped 
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With a high-gam first dynode with Poissonian statistics (see Figure 11 37b), 
this IS a not-unreasonable approximation Note that when a PMT is used in a 
non-counting or d c mode, as was discussed previously in Section 116, all of 
the output electrons resulting from spurious cathode emission (i e dark counts) 
and dynode emission are integrated by the anode or preamplifier time constant 
into a d c dark current, and the opportunity to remove dynode noise by pulse- 
height discrimination is lost 


11 10 3 Pulse-height Discrimination 

Each electron emitted by a PMT phoiocathode will be amplified by the in- 
stantaneous value of the PMT gam For a mean gam of 10^ for example, each 
cathode electron will cause an average output charge q of 10®e coulomb 
This charge q will accumulate at the anode during a time t given by the transit- 
time spread of the PMT Typically, t will be about 10 ns, so that the resulting 
an odecurrent pulse (i, = dq/dOwill have a full width (r J between half-maximum 
amplitude points (F WHM) of about 10 ns also The peak value, /p^, of the anode- 
cunent pulse may be approximated by assuming the pulse to be rectangular, 
80 that in our example. 


/ 


pk — 


tw 


10«e 10«xl6xl0-‘’ ,, . 

10 X 10~’“ 10x10'^ 


In a photon<ountuig system, the anode-load resistor (RJ of the PMT is 
kept small, usually 50 100 Q 'nierefore the time constant (t.) formed by the 
anode stray capacitance (C,) will be small compared to and thus will not 
stretch the anode voltage pulse Typically, R, = 50 n and C, =* 20 pF, so that 
T, “ 1 ns <1 1 ,, The anode voltage pulse will then have the same shape as the 
anode current pulse, and a peak value of 

Epk = /pkR, = 16 X 10"® X 50 = 08 mV 

It should be remembered that such pulse ampbtudes depend upon the 
FMT gam which, in turn, depends upon the dynode gams — which are statistical 
In the above example then, Ej,^ = 08 mV is the average pulse height to be 
expected Actual pulse heights will be distributed above and below this value 
and the better the PMT, the narrower will be this distribution A preamplifier 
IS normally used to amplify the anode pulses to a suitable level for the pulse- 
height discnminator 

Notice that the PMT gain, the preamplifier gam, and the discriminator 
threshold controls may all be used to adjust the effective discrimination level 
Figure 1138 shows a typical count-rate variation with PMT high voltage 
(i e PMT gam) and a fixed discrinunator threshold level This is one of a family 
of curves that could be plotted for different threshold levels, similar curves 
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Figure 11.38 The counting plateau (reproduced 
by permission of EG&G Princeton Applied Re- 
search Corporation) 


could be obtained by varying the preamplifier gain rather than that of the PMT, 
or by plotting count rate against discriminator threshold (Darland et al, 
1979). The upper curve in Figure 11.38 was plotted by allowing fight to fall 
upon the PMT photocathode and slowly varying the PMT voltage (which is 
non-linearly related to the PMT gain). Notice that the steep slope at low PMT 
voltages begins to flatten and form a (not-quite-horizontal) plateau as proper 
focusing takes place in the PMT. The increasing slope at very high PMT 
bias voltages is due to increasing instability in the PMT. 

The upper curve corresponds to S + N, since it is based on both signal and 
noise pulses. The lower curve was plotted with the PMT in darkness, and 
therefore represents noise (TV) pulses only. Notice that typically the dark- 
count curve has no plateau; it has been suggested that the lack of a plateau 
is due to corona effects associated with microscopic protrusions from the 
dynode surfaces. Since the count rate is plotted on a logarithmic scale, the 
vertical distance between the two curves corresponds to 

log(S + N)- log(TV) = log^- = log^l 

A commonly used method of setting the PMT high voltage for a given 
preamplifier gain and discriminator threshold level is, therefore, to select a 
point on the beginning of the counting plateau corresponding to maximum 
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11.10 4 Ratcmetcrs and Counters 

A simplified ratemeter arcuit is shown in Figure 1 1 39 Each output pulse from 
the discriminator results in a precise current pulse being averaged by the low- 
pass filler The ratemeter output voltage is, therefore, proportional to the 
av crage v alue of the discriminator output count rate (R,,,) 

A digital alternative to the ratemeter. a timer-counter circuit, is shown in 
simplified form in Figure 1 1 40 Both counters, A and C, are started and stopped 
together Counter C is a presettable counter a number N is preset, usually by 
means of thumbw heel sw itches, and the counter will stop when its accumulated 
count equals N When dnven from an interna! clock oscillator, this arrange- 
ment IS called a rimer circuit In the normal mode, a 1 MHz internal clock is 
often used so that the timer is used to set the measurement time t = ps 

The output count will be simpl) A = tR, , 

The ratio mode is usually used for source compensation where the signal 
count rate is proportional to both the measurand and (say) the intensity of a 
light source By monitoring the light source with a separate PMT and amplifier- 
discnminator to produce a source-dependent count rate R,e, R,,, can be 
normalized by R„ to provide an output count that is independent of source 
fluctuations 

In the reciprocal mode, the system will measure the lime (r, m fis) required for 
the cumulative signal counts to reach N, the smaller the signal count rate, the 
longer the elapsed time If the dark count rate is negligible, then the measure- 
ment accuracy is 1/SNR * 1/^(R,,0 and foraconstant value of Af), 

all measurements will have the same SNR and accuracy 

A s)iicbrQnous counting sxswn that acts like a diguol lock in amplifier to 
provide automatic background subtraction is shown in Figure 11 41 When the 
chopper blade blocks the input light, the output pulses from the araplifier- 
discnminator are, by definition, background noise, these pulses (JV) are gated 
into counter B by the timing circuit— which is itself synchronized by the 
chopper reference signal ^Vhcn the chopper blade allows light to reach the 
PMT, the discriminator output consists of signal plus-background pulses 



Ficure IIJ9 The ralcmeicr (simplilicd) (reproduced by permission of EG&G 
Princeton Applied Research Corporation) 
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Figure 11.40 Counter-timer modes of operation. (1) Normal mode; 

R, = A = cc R„g. (2) Ratio mode; = Rs.g, Rc = -Rs.- 4 = NR,JR^^ 

cc RsJRsc- (3) Reciprocal mode; = ^cik, Rc = R,,s, 4 = NR^JR,,^ cc 1/R„g. 


(S + JV) and these pulses are gated into counter A. After each measurement 
interval, an arithmetic circuit provides two outputs 

A-B = (S + N)-N = S = signa.\ (11.52) 

and 

A + B=^(S + N) + N- total counts (11.53) 

where A and B are the numbers of counts in counters A and B respectively. 
For Poissonian noise, 


SNR = 


signal A — B 

.^(total counts) ^(A + B) 


(11.54) 


Suppose, for example, that A = 10® counts and B = 9.99 x 10® counts then 
5 = A — B = 10® counts and ^/(A + B) = 1.41 x 10®, so that 


SNR = 


A-B 10® 

7(A + B)~ 1.41 X 10® 


0.71 


and (in)accuracy 


SNR 0.71 

or, expressed in words, the measurement is worthless ! The A + B output is 
important since it allows the measurement accuracy to be estimated in this 
way. 
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n. 10.5 Pulse Pile-up 

The dynamic range of photon-counting measurements is limited at low light 
levels by PMT dark count and, at high light levels, by pulse pile-up in the 
PMT or electronics. As the mean rate (R) of photons arriving at a PMT photo- 
cathode increases, then so does the probability of two or more photons arriving 
with too short an interval between them to be resolved by the PMT. 

The time-resolution of a PMT is effectively equal to its output pulse width 
t„,, and each output pulse from a PMT, therefore, occurs whenever an electron 
is emitted after a time greater than following the previous electron. The 
probability of this happening is the same as that for zero photoelectron events 
in a time (we can neglect dark counts at high light levels). As shown in Figure 
11.37a and from equation (11.45) then 


P(0, O = exp(-CRfJ (11.55) 

and the output count rate, R^, is given by 

R, = R(0, tJRi = CR exp(-CRO = exp(-RiO (11.56) 
The resulting PMT pulse pile-up error is given by 

Ri-Ro Rj - Riexp(-RifJ 


^pmt 


R: 


Ri 


= 1 -exp(-CRtw) (11.57) 


The PMT is a paralysable detector; that is, when the input count rate exceeds 
a certain value (Rj = l/t„), the output count rate will begin to decrease for an 
increasing input count rate and will become zero when the PMT is completely 
paralysed (saturated) (See Figure 11.42). 



Figure 1 1.42 Counting Error due to pulse piie-up (reproduced by 
permission of EG&G Princeton Applied Research Corporation) 
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Discnmmators and counters, on the other hand, are usually non-paralysable 
Suppose a discnmmator, for example, has a pulse-pair resolution or dead tune 
That IS, each time it accepts an input pulse, it cannot accept a new pulse 
until after a time r,, Then for a measurement time t, an input pulse rate of K, 
and an output rate R„, the total number of output pulses, N^, is given by 

No = 


and 


the total dead time = = RotU 


so that 


total hie time = f — Rgtti 

The total number of input pulses accepted is therefore given by 


= Rj = R.O - Rjti) 

so that 

R 

1 + Kitj 

A modem, fast discriminator and counter have a dead time, fd* about 
10 ns — similar to the lime resolution, r„ of a reasonably fast PMT Notice, 
hoNNe\er, that any PMT pile-up will act as a prefiUer to the discnmmator, 
that is, such pile>up will decrease the input count rate to the discriminator 
PMT pile up usually provides the upper limn to the system dynamic range 
Photon counting systems cannot be used in pulsed light measurements where 
the peak photon rale (during the light pulse) will cause unacceptably high 
pulse pile-up errors 


11.11 FINAL COMMENTS 

Many of the signal-recovery considerations discussed in this chapter, such as 
instrument selection, are summarized in Figure 1 1 43 Notice that this figure 
indiudestw o instruments not mentioned m the preceding sections ol this chapter 
the multichannel analyser (MCA), and the photon (digital) correlator The 
choice of signal-recovery instrument is often limited to that which is available 
and, in some applications, the MCA can be used in place of a multipoint 
averager Similarly, a photon correlator, if available, may possibly be sub- 
stituted for an analog correlator 

In Its multichannel scaling (MCS) mode, the MCA (Nicholson, 1974) 
consists effectively of a scaler (counter) connected to a digital memory much 
like that of a multipoint aierager During each sweep, the scaler sequentially 
counts the number of input pulses dunng each dwell time and adds that number 
to the cumulative count in the correspondmg memory address By using a 
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\oltage-to-frequency converter (VFQ ahead of an MCA in MCS mode, 
analog signals can be tune-averaged, and the VFC/MCS combination is 
essentially a multipoint averaging system The MCA can also be used in a 
pulse-height analysis mode, where the araphtude of each mput pulse is digitized 
and used to self-address the memory In other words, each mpui pulse with an 
amplitude between 63 85 % and 63 95% (say) of full-scale, will add one count to 
memory address No 639 In this way, a pulse-height distnbution, or spectrum, 
is built up Another common MCA measurement technique is to precede the 
MCA by a time-to-amplitude converter (TAC), so that each input pulse to be 
digitized corresponds to a tune mterval Low-level measurements of short 
fluorescent lifetimes, for example, may be made in this fashion 

The photon correlator (Cumuuns and Pike, 1974) is similar m many ways to 
the analog-mput autocorrelator descnbed m Section 11 9 The input signal is 
in the form of pulses, from an amplifier-discnmmator, and data processmg is 
in senal, rather than parallel, form — with counters replacing the analog cor- 
relator’s memory Such digital correlators usually provide at least one mode of 
clipped operation where, for example, n or more input pulses in a lag time (r) 
may correspond to a one, and less than « pulses correspond to a zero Such 
clipped operation can allow very fast binary shifting and multiplication 

The flowchart of Figure 1 1 43 makes no attempt to mclude all instruments or 
s}stems A lock-in amplifier is often used in front of a multipoint averager in 
order to reduce l/f noise problems Similarly, a boxcar/multipomt averager 
combination can offer the picosecond or nanosecond time resolution of the 
boxcar — without the need to scan so slowly that the system being measured 
may change during a scan (sweep) 

A last comment the object m signal recovery is not to maximize the SNIR 
but to minimize the measurement time required to reach a particular output 
SNR Similarly, m sclectmg a preamplifier, the real object is to minimize noise, 
not noise figure The noise and/or bandwidth of the signal source or transducer 
should, therefore, be minimized before seeking mstrumentational means of 
further SNR improvement 
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Chapter 



E. LZUCH 


Signal Data Conversion 


Editorial introduction 

The majority of measurands are of analog form. Actuators and other output devices exist 
in both analog and digital form. Processing of signal information is often best performed 
using digital formats because of the increasing applicability of binary electronic circuitry. 
Interfacing analog and digital stages is, therefore, a most important part of measurement 
systems design. This chapter provides the basis of signal domain conversion. 


12.1 DATA ACQUISITION SYSTEMS 
12,1.1 Introduction 

Data acquisition and conversion systems interface between the real world 
of physical parameters, which are analog, and the artificial world of digital 
computation and control. With current emphasis on digital systems, the inter- 
facing function has become an important one; digital systems are used widely 
because complex circuits are low cost, accurate, and relatively simple to imple- 
ment. In addition, there is rapid growth in use of minicomputers and micro- 
computers to perform difficult digital control and measurement functions. 

Computerized feedback control systems are used in many different industries 
today in order to achieve greater productivity in our modern industrial societies. 
Industries which presently employ such automatic systems include steel making, 
food processing, paper production, oil refining, chemical manufacturing, 
textile production, and cement manufacturing. 

The devices which perform the interfacing function between analog and 
digital worlds are analog-to-digital (A/D) and digital-to-analog (D/A) con- 
verters, which together are known as data converters. Some of the specific 
applications in which data converters are used include data telemetry systems. 
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pulse code modulated communications, automatic test systems, computer 
display systems, video signal processing systems, data logging systems, and 
sampled data control systems In addition, every laboratory digital multi- 
meter or digital panel meter contains an A/D converter 


12 1.2 Basic Data Acquisition System 

Besides A/D and D/A converters, data acquisition and distribution systems 
may employ one or more of the following circuit functions 

(a) transducers, 

(b) amplifiers, 

(c) filters, 

(d) non-linear analog functions . 

(e) analog multiplexers , 

(f) sample-holds 

The interconnection of these components is shown in the diagram of the data 
acquisition portion of a computerized feedback control system in Figure 12 1 
The input to the system is a physical parameter such as temperature, pressure, 
flow, acceleration, and position, which are analog quantities The parameter is 
first con%erted mto an electrical signal by means of a transducer, once in 
electrical form, all further processing is done by electronic circuits 
Next, an amplifier boosts the ampbtude of the transducer output signal to a 
useful level for further processing Transducer outputs may be microvolt or 
millivolt level signals which are then ampbfied to 1 to 10 volt levels Further- 
more, the transducer output may be a high impedance signal, a differential 
signal with common-mode noise, a current output, a signal superimposed on a 
high voltage, or a combination of these TTie amplifier, m order to convert such 
signals into a high-level voltage, may be one of several specialized types 
The amplifier is frequently followed by a low-pass active filter which reduces 



Figure 12 1 Data acquisition system 
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high-frequency signal components, unwanted electrical interference noise, or 
electronic noise from the signal. The amplifier is sometimes also followed by a 
special non-linear analog function circuit which performs a non-linear operation 
on the high-level signal. Such operations include squaring, multiplication, 
division, r.m.s. conversion, log conversion or linearization. 

The processed analog signal next goes to an analog multiplexer which switches 
sequentially between a number of different analog input channels. Each input 
is in turn connected to the output of the multiplexer for a specified period of time 
by the multiplexer switch. During this connection time a sample-hold circuit 
acquires the signal voltage and then holds its value while an A/D converter 
converts the value into digital form. The resultant digital word goes to a com- 
puter data bus or to the input of a digital circuit. 

Thus the analog multiplexer, together with the sample-hold, time shares the 
A/D converter with a number of analog input channels. The timing and control 
of the complete data acquisition system is done by a digital circuit called a 
programmer-sequencer, which in turn is under control of the computer. In 
some cases the computer itself may control the entire data acquisition system. 

While this is perhaps the most commonly used data acquisition system 
configuration, there are alternative ones. Instead of multiplexing high-level 
signals, low-level multiplexing is sometimes used with the amplifier following 
the multiplexer. In such cases just one amplifier is required, but its gain may 
have to be changed from one channel to the next during multiplexing. Another 
method is to amplify and convert the signal into digital form at the transducer 
location and send the digital information in serial form to the computer. Here 
the digital data must be converted to parallel form and then multiplexed onto 
the computer data bus. 

12.1.3 Basic Data Distribution System 

The data distribution portion of a feedback control system, illustrated in 
Figure 12.2, is the reverse of the data acquisition system. The computer, based 
on the inputs of the data acquisition system, must close the loop on a process 
and control it by means of output control functions. These control outputs 
are in digital form and must, therefore, be converted into analog form in order 
to drive the process. The conversion is accomplished by a series of D/A con- 
verters as shown (also often called DAC’s). Each D/A converter is coupled to 
the computer data bus by means of a register which stores the digital word 
until the next update. The registers are activated sequentially by a decoder 
and control circuit which is under computer control. 

The D/A converter outputs then drive actuators which directly control the 
various process parameters such as temperature, pressure, and flow. Thus the 
loop is closed on the process and the result is a complete automatic process 
control system under computer control. 




Figure 122 Data disinbuuon s)steTn 
12.2 QUANTIZING THEORY 


12.2.1 Introduction 

A/D conversion m its basic conceptual form is a two*step process quantizing 
and coding Quantizing is the process of transforming a continuous analog 
signal into a set of discrete output slates Coding is the process of assigning a 
digital code v, ord to each of the output stales Some of the early A/D converters 
uere appropriately called quantizing encoders 

12.2.2 Quantizer Transfer Function 

The non-linear transfer function shown in Figure 12 3 is that of an ideal quantizer 
with eight output stales, with output code words assigned, it is also that of a 
3-bit A/D converter The eight output stales are assigned the sequence of binary 
numbers from 000 through to 111 The analog input range for this quantizer is 
Oto +10V 

There are several important points concerning the transfer function of 
Figure 12.3 First, the resolution of the quantizer is defined as the number of 
output states expressed in bits , in this case it is a 3-bit quantizer The number of 
output states for a binary coded quantizer is 2", where n is the number of bits 
Thus, an 8-bit quantizer has 256 output states and a I2-bit quantizer has 4096 
output states 
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Figure 12.3 Transfer function and error of idea] 3-bit quantizer 

As shown in the diagram, there are 2" — 1 analog decision points (or threshold 
levels) in the transfer function. These points are at voltages of +0.625, + 1.875, 
+3.125, +4.375, +5.625, +6.875, and +8.125 V. The decision points must be 
precisely set in a quantizer in order to divide the analog voltage range into the 
correct quantizer values. 

The voltages +1.25, +2.50, +3.75, +5.00, +6.25, +7.50, and +8.75 V are 
the centre points of each output code word. The analog decision point voltages 
are precisely halfway between the code word centre points. The quantizer 
staircase function is the best approximation which can be made to a straight 
line drawn through the origin and full scale point; notice that the line passes 
through all of the code word centre points. 

12.2.3 Quantizer Resolution and Error 

At any part of the input range of the quantizer, there is a small range of analog 
values within which the same output code word is produced. This small range 
is the voltage difference between any two adjacent decision points and is known 
as the analog quantization size, or quantum, Q. In Figure 12.3, the quantum is 
1.25 V and is found in general by dividing the full scale analog range by the 
number of output states. Thus 


Q = FSR/2' 


( 12 . 1 ) 
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^^he^e FSR is the full scale range, or 10 V m this case Q is the smallest analog 
difference uhich can be resohed, or distinguished, by the quantizer In the case 
of a 12-bit quantizer, the quantum is much smaller and is found to be 


FSR 


10 V 

4m 


= 2.44 mV 


( 12 ^) 


If the quantizer mpm is moied through its entire range of analog I’alues and 
the difference between output and mput is taken, a sawtooth error function 
results, as shown m Figure 123 This function is called the quantizing error and 
IS the irreducible error which results from the quantizmg process It can be 
reduced onlj bj increasmg the number of output states (or the resolution) of the 
quantizer, thereby making the quantization finer 
For a gi\en analog mput ralue to the quantizer, the output error will \-ary 
anywhere from 0 to ±QJ1, the error is zero only at analog values correspondmg 
to the code centre pomts This error is also frequently called quantization un- 
certainty or quantization noise 

The quantizer output can be thought of as the analog mput with quantiza^ 
tion noise added to it The noise has a peak*to*peak \alue of Q but as with 
other types ofnoise, the aierages’alue IS zero Ilsrjits. value, bowe\er, is useful 
m analysis and can be computed from the triangular waveshape to be Q/2^3 


123 SA.MPLrsG THEORY 


123 1 Introduction 

An A/D converter requires a small, but significant amount of time to perfonn 
the quantizmg and codmg operations The time required to make the con- 
version depends on several factors the converter resolution, the conversion 
techmque, and the speed of the components employed m the converter The 
coDVfzsyruj speed retjvzred /nr ,2 pzrt/ruJar depends on ibe lane 

variation of the signal to be converted and on the accuracy desired. 


1233 Aperture Time 

Conversion time is frequently referred to as aperture time In general, aperture 
time refers to the time uncertamty (or lime vnndow) in making a measurement 
and results m an amplitude uncertamty (or error) m the measurement if the 
signal IS changmg dunng this time 

As shown m Figure 124, the mput signal to the A/D converter changes by 
AV dimng the aperture time 1 , m w hich the conv ersion is performed. The error 
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Figure 12.4 Aperture time and 
amplitude uncertainty 


can be considered an amplitude error or a time error; the two are related as 
follows: 


AV = t, 


dno 

df 


(12.3) 


where dV(t)/dt is the rate of change with time of the input signal. 

It should be noted that A V represents the maximum error due to signal change, 
since the actual error depends on how the conversion is done. At some point 
in time within the signal amplitude corresponds exactly with the output code 
word produced. 

For the specific case of a sinusoidal input signal, the maximum rate of change 
occurs at the zero crossing of the waveform, and the amplitude error is 


AV = tg — (/4 sin cot),-Q = t^Aco 
dt 


(12.4) 


The resultant error as a fraction of the peak-to-peak full scale value is 

e > = nft. (12.5) 

From this result the aperture time required to digitize a 1 kHz signal to 10 
bits resolution can be found. The resolution required is one part in or 
0 . 001 . 


e 0.001 
Irf ~ 3.14 X 10^ 


= 320 X 10“ ® 


( 12 . 6 ) 


The result is a required aperture time of just 320 ns ! One should appreciate the 
fact that 1 kHz is not a particularly fast signal, yet it is difficult to fed a 10 bit 
A/D converter to perform this conversion at any price! Fortunately, there is 
a relatively simple and inexpensive way around this dilemma by using a sample- 
hold circuit. 
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12JJ Sample-holds and Aperture Error 

A sample hold circuit samples thesignal \oItage and then stores it on a capaator 
for the time-required to perform the A/D comersion The aperture time of the 
A/D con\erter is, therefore, greatly reduced by the much shorter aperture tune 
of the sample-hold circuit In turn, the aperture tune of the sample-hold is a 
function of its bandwdth and switching time 
Figure 12 5 IS a useful graph of equation {li5) It gi\es the aperture lime 
required for coniertmg sinusoidal signals to a maximum error less than one 
piart in 2* where n is the resolution of the con\erter m bits The peak-to-peak 
salue of the sinusoid is assumed to be the full scale range of the A/D converter 
The graph is most useful m selecting a sample-bold by aperture time or an A/D 
converter bj conversion tune 



SINUSOIDAL FSEOUENCY (Hz] 

Ficure 12.5 Graph for apenure error for sinusoidal signals 
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12.3.4 Sampled-data Systems and the Sampling Theorem 

In data acquisition and distribution systems, and other sampled-data systems, 
analog signals are sampled on a periodic basis as illustrated in Figure 12.6. 
The train of sampling pulses in Figure 12.6b represents a fast-acting switch 
which connects to the analog signal for a very short time and then disconnects 
for the remainder of the samphng period. 

The result of the fast-acting sampler is identical with multiplying the analog 
signal by a train of sampling pulses of unity amplitude, giving the modulated 
pulse train of Figure 12.6c. The amplitude of the original signal is preserved in 
the modulation envelope of the pulses. If the switch-type sampler is replaced by 
a switch and capacitor (a sample-hold circuit), then the amplitude of each sample 
is stored between samples and a reasonable reconstruction of the orginal 
analog signal results, as shown in Figure 12.6d. 

A common use of sampling is in the efficient use of data processing equip- 
ment and data transmission facilities. A single data transmission link, for 
example, can be used to transmit many different analog channels on a sampled, 
time-multiplexed, basis, whereas it would be uneconomical to devote a complete 
transmission link to the continuous transmission of a single signal. 

Likewise, a data acquisition and distribution system is used to measure and 
control the many parameters of a process control system by sampling the 
parameters and updating the control inputs periodically. In data conversion 



Figure 12.6 Signal sampling; (a) signal; (b) sampling 
pulses; (c) sampled signal; (d) sampled and held signal 
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systems it is common to use a single, expensive A/D converter of high speed and 
precision and then multiplex a number of analog inputs into it 
An important fundamental question to answer about sampled-data systems 
IS this ‘How often must I sample an analog signal in order not to lose informa- 
tion from if^’ It IS obvious that all useful information can be extracted if a 
slowly varying signal is sampled at a rate such that little or no change takes 
place between samples Equally obvious is the fact that information is bemg 
lost if there is a significant change m signal amplitude between samples 
The answer to the question is contained in the well known samphng theorem 
which may be stated as follows If a continuous, bandwidth-limited signal con- 
tains no frequency components higher than f^, then the original signal can be 
recoiered without distortion if it is sampled at a rate of at least 2f samples per 
second 


123.5 Frequency Folding and Alia^ng 

The sampling theorem can be demonstrated by the frequency spectra illustrated 
in Figure 127 Figure 127a shows the frequency spectrum of a continuous 
bandwidth-limited analog signal with frequency components out to/* When this 
signal IS sampled at a rate /, , the modulation process shifts the original spectrum 
out to/,, 2/„ 3/,, , in addition to the one at the origin A portion of this 

resultant spectrum is shown m Figure 12 7b 
If the sampling frequency /, is not high enough, part of the spectrum centred 
about/ will fold over into the original signal spectrum This undesirable effect 
IS called frequency folding In the process of recovering the original signal, the 



Figure 12 7 Frequency spectra demonstrating the samp} 
mg theorem (a) contmuous signal spectrum, (b) sampled 
signal spectrum 
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rate 

folded part of the spectrum causes distortion in the recovered signal which 
cannot be eliminated by filtering the recovered signal. 

From the figure, if the sampling rate is increased such that /^ — f^> 
then the two spectra are separated and the original signal can be recovered 
without distortion. This demonstrates the results of the sampling theorem that 
/s > 2/,. Frequency folding can be eliminated in two ways; first by using a high 
enough sampling rate, and second by filtering the signal before sampling to 
limit its bandwidth iofjl. 

It must be appreciated that in practice there is always some frequency folding 
present due to high-frequency signal components, noise, and non-ideal pre- 
sample filtering. The effect must be reduced to negligible amounts for the 
particular application by using a sufficiently high sampling rate. The required 
rate, in fact, may be much higher than the minimum indicated by the sampling 
theorem. 

The effect of an inadequate sampling rate on a sinusoid is illustrated in 
Figure 12.8; an alias frequency in the recovered signal results. In this case, 
sampling at a rate slightly less than twice per cycle gives the low-frequency 
sinusoid shown by the dotted line in the recovered signal. This alias frequency 
can be significantly different from the original frequency. From the figure it is 
easy to see that if the sinusoid is sampled at least twice per cycle, as required by 
the sampling theorem, the original frequency is preserved. 

12.4 CODING FOR DATA CONVERTERS 
12.4.1 Natural Binary Code 

A/D and D/A converters interface with digital systems by means of an appro- 
priate digital code. While there are many possible codes to select, a few standard 
ones are almost exclusively used with data converters. The most popular code 
is natural binary, or straight binary, which is used in its fractional form to repre- 
sent a number 


N — 0^2 -t- ^2^ ^ "F <^32 ^ -t" • • • -f" o„2 


(12.7) 
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where each coefficient a assumes a \alueofzero or one N has a value between 
zero and one 

A binary fraction is normally written as 0 IIOIOI, but with data con\erter 
codes the deamal point is omitted and the code word is written IIOIOI This 
code w ord represents a fraction of the full scale \ alue of the con\ erter and has 
no other numerical significance 

The bmary code word 110101, therefore, represents the decimal fraction 
(1 X 05) + (I X 025) + (0 X 0125) + (I x 00625) + (0 x 003125) 

+ (1 X 0015625) = 0 828125 or 828125% 

of full scale for the con\erter If full scale is + 10 V, then the code word repre- 
sents +8 28125 V The natural bmary code belongs to a class of codes known as 
positice weighted codes since each coefficient has a speafic weight, none of 
which is negatne 

The leftmost bit has the most weight, 0 5 of full scale, and is called the most 
Significant bit (MSB) , the nghtmost bit has the least w eight, 2~" of full scale, and 
IS, therefore, called the least significant bit (LSB) The bits in a code word are 
numbered Crom left to right from 1 to n. 

The LSB has the same analog equnalent value as Q discussed previously, 
namely 


LSB (analog value) « FSR/2' (12 8) 

Table 12 1 ts a useful summary of the resolution, number of states, LSB weights, 
and dynamic range for data converters from one to twenty bits resolution 
The dynamic range (DR) of a data converter m deabels (dB) is found as 
follows 


DR = 20 log 2" = 20n log 2 
= 20fi(0301) = 602n 


(12-9) 


Kirenr DR is djnamrc range, n ts (he number of brCs, anJ 2" (he number cf 
states of the conv erter Since 6 02 dB corresponds to a factor of tw o, it is simply 
necessary to multiply the resolution of a converter m bits by 601 A 12-bit 
conv erter, for example, has a dynamic range of 7Z2 dB 
An important pomt to notice is that the maximum value of the digital code, 
namely all Ts, does not correspond with analog full scale, but rather with one 
LSB less than full scale, or FS(1 — 2“") Therefore a 12-bit converter with a 
Oto +10 Vanalogrange has a maximum code of nil 1111 1111 andamaxi- 
mum analog value of + 10 V(1 — 2 **) = +9 99756 V In other words, the 
maximum analog value of the converter, correspondmg to all ones in the code, 
nev er quite reaches the pomt defined as analog full scale 
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Table 12.1 Resolution, number of states, LSB weight, and dynamic range for data 

converters 


Resolution 
bits n 

Number of states 

2" 

LSB Weight 

2-n 

Dynamic range 
(dB) 

0 

1 

1 

0.0 

1 

2 

0.5 

6.0 

2 

4 

0.25 

12.0 

3 

8 

0.125 

18.1 

4 

16 

0.0625 

24.1 

5 

32 

0.03125 

30.1 

6 

64 

0.015625 

36.1 

7 

128 

0.0078125 

42.1 

8 

256 

0.00390625 

48.2 

9 

512 

0.001953125 

54.2 

10 

1 024 

0.0009765625 

60.2 

11 

2 048 

0.00048828125 

66.2 

12 

4 096 

0.000244140625 

72.2 

13 

8 192 

0.0001220703125 

78.3 

14 

16 384 

0.00006103515625 

84.3 

15 

32 768 

0.000030517578125 

90.3 

16 

65 536 

0.0000152587890625 

96.3 

17 

131 072 

0.00000762939453125 

102.3 

18 

262 144 

0.000003814697265625 

108.4 

19 

524 288 

0.0000019073486328125 

114.4 

20 

1 048 576 

0.00000095367431640625 

120.4 


12.4.2 Other Binary Codes 

Several other binary codes are used with A/D and D/A converters in addition 
to straight binary. These codes are offset binary, two's complement, binary coded 
decimal (BCD), and their complemented versions. Each code has a specific 
advantage in certain applications. BCD coding for example is used where 
digital displays must be interfaced such as in digital panel meters and digital 
multimeters. Two’s complement coding is used for computer arithmetic logic 
operations, and offset binary coding is used with bipolar analog measures. 

Not only are the digital codes standardized with data converters, but so 
also are the analog voltage ranges. Most converters use unipolar voltage ranges 
of 0 to + 5 V and 0 to + 10 V although some devices use the negative ranges 0 to 
- 5 V and 0 to — 10 V. The standard bipolar voltage ranges are + 2.5 V, + 5 V 
and + 10 V. Many converters today are pin-programmable between these 
various ranges. 

Table 12.2 shows straight binary and complementary binary codes for a 
unipolar 8-bit converter with a 0 to -f 10 V analog FS range. The maximum 
analog value of the converter is -f 9.961 V, or one LSB less than + 10 V. Note 
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Table 12-2 Binarj coding for 8 bit ympolar comerters 


Fraction ofFS 

-H0\ FS 

Straight binary 

Complementary 

binary 

-irFS- ILSB 

-1-996! 

nil nil 

0000 0000 

+ iFS 

+ 7500 

11000000 

0011 nil 

+ IFS 

+ 5000 

10000000 

oni nil 

+ 1FS 

+ 2500 

01000000 

1011 nil 

+ iFS 

+ I 250 

0010 0000 

iioi nil 

-1-1 LSB 

+0039 

00000001 

nil nio 

0 

0000 

00000000 

nil nil 


that the LSB size is 0 039 V as shonti near the bottom of the table The comple- 
meniary binary coding used in some converters is simply the logic complement 
of straight binary 

When A/D and D/A converters are used m bipolar operation, the analog 
range is offset by half scale, or by the MSB value The result is an analog shift 
of the converter transfer function as shown in Figure 129 Notice for this 3*bit 
A/D converter transfer function that the code 000 conesponds with —5 V, 100 
with 0 V, and 1 1 1 with + 3 75 V Since the output codmg is the same as before 
the analog shift, it is now appropnately called offset binary coding 

Table 113 shows the offset binary code together with complementary offset 
binary, rwo's complement, and stgmmagnitude binary codes These are the most 
popular codes employed in bipolar data converters 



Figure 12 9 Transfer function for bipolar 3 bit 
A/D converter 
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Table 12.3 Popular bipolar codes used with data converters 


Fraction of FS 

±5 VFS 

Offset 

binary 

Comp. off. 
binary 

Two’s 

complement 

Sign-mag. 

binary 

+FS - 1 LSB 

+ 4.9976 

nil nil 

0000 0000 

0111 nil 

nil nil 

+|FS 

+ 3.7500 

1110 0000 

0001 nil 

0110 0000 

1110 0000 

-f|FS 

+2.5000 

1100 0000 

0011 nil 

0100 0000 

1100 0000 

+iFS 

+ 1.2500 

1010 0000 

0101 nil 

0010 0000 

1010 0000 

0 

0.0000 

1000 0000 

0111 nil 

0000 0000 

1000 0000* 

-iFS 

-1.2500 

0110 0000 

1001 nil 

1110 0000 

0010 0000 

-iFS 

-2.5000 

0100 0000 

1011 nil 

1100 0000 

0100 0000 

-JFS 

-3.7500 

0010 0000 

1101 nil 

1010 0000 

0110 0000 

-FS + 1 LSB 

-4.9976 

0000 0001 

nil 1110 

1000 0001 

0111 nil 

-FS 

-5.0000 

0000 0000 

nil nil 

1000 0000 

— 


* Sign magnitude binary has two code words for zero as shown here: 

0+ 1000 0000 0000 
0 - 0000 0000 0000 


The two’s complement code has the characteristic that the sum of the positive 
and negative codes for the same analog magnitude always produces all zeros and 
a carry. This characteristic makes the two’s complement code useful in arithmetic 
computations. Notice that the only difference between two’s complement and 
offset binary is the complementing of the MSB. In bipolar coding, the MSB 
becomes the sign bit. 

The sign-magnitude binary code, infrequently used, has identical code words 
for equal magnitude analog values except that the sign bit is different. As shown 
in Table 12.3 this code has two possible code words for zero : 1000 0000 or 
0000 0000. The two are usually distinguished as 0+ and 0 — , respectively. 
Because of this characteristic, the code has maximum analog values of 
+ (FS — ILSB) and reaches neither analog +FS nor — FS. 


12.4.3 BCD Codes 

Table 12.4 shows BCD and complementary BCD coding for a three-decimal 
digit data converter. These are the codes used with integrating type A/D 
converters employed in digital panel meters, digital multimeters, and other 
decimal display applications. Here four bits are used to represent each decimal 
digit. BCD is a positive weighted code but is relatively inefScient since in each 
group of four bits, only 10 out of a possible 16 states are utilized. 

The LSB analog value (or quantum, Q) for BCD is 


LSB(analog value) = Q = FSR/10‘' 


( 12 . 10 ) 
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Table 12 4 BCD and complcmenlAry BCD coding 


Fraction of FS 

+ I0VFS 

Binary coded decimal 

Complementary BCD 

+ FS - 1 LSB 

+999 

1001 1001 tool 

oiioono 0110 

+ IFS 

+ 7 50 

011101010000 

1000 10101111 

+ 4FS 

+ 500 

010100000000 

10101111 nil 

+ IFS 

+ 2 50 

001001010000 

1101 10101111 

+ sFS 

+ 125 

0001 0010 0101 

1110 1101 1010 

+ 1 LSB 

+001 

000000000001 

nil nil Mio 

0 

000 

000000000000 

nil nil nil 


where FSR is the full scale range and d is the number of decimal digits For 
example if there are three digits and the full scale range is 10 V, the LSB value is 

LSBianahg value) * JO V/JO’ ^001V= WmV (12 1 1) 

BCD coding is frequently used with an additional overrangc bit which has a 
weight equal to full scale and produces a 100% increase in range for the A/D 
converter Thus for a converter with a decimal full scale of 999, an overrange bit 
provides a new full scale of 1999, twice that of the previous one In this case, 
the maximum output code is 1 1001 1001 1001 The additional range is com- 
monly referred to as J digit, and the resolution of the A/D com erter m this case 
IS 3 - i digits 

Likewise, if this range is again expanded by 100%, a new full scale of 3999 
results and is called 3-1 digits resolution Here two overrange bits have been 
added and the full scale output code is 1 1 1001 1001 1001 When BCD coding 
IS used for bipolar measurements another bit a sign bit, is added to the code and 
the result is sign magnitude BCD coding 


12 S AMPLIFIERS AND FILTERS 
12^ 1 Operational and Instrumentation Ampliliers 

The front end of a data acquisition system extracts the desired analog signal from 
a physical parameter by means of a transducer and then amplifies and filters it 
An amplifier and filter are critical components m this initial signal processing 
The amplifier must perform one or more of the following functions boost the 
signal amplitude, buffer the signal, convert a signal current into a voltage, or 
extract a differential signal from common mode noise 
To accomplish these functions requires a vanety of different amplifier types 
The most popular type of amplifier is an operational amplifier (op amp ) which 
IS a general purpose gam block with differential inputs The op amp may be 
connected in many different closed loop configurations, of which a few are 
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Figure 12.10 Operational amplifier configuration!^ 


shown in Figure 12.10. The gain and bandwidth of the circuits shown depend 
on the external resistors connected around the amplifier. An operational 
amplifier is a good choice in general where a single-ended signal is to be amplified, 
buffered, or converted from current to voltage. 

In the case of differential signal processing, the instrumentation amplifier 
is a better choice since it maintains high impedance at both of its differential 
inputs and the gain is set by a resistor located elsewhere in the amplifier circuit. 
One type of instrumentation amplifier circuit is shown in Figure 12.11. Notice 



Figure 12.11 Simplified instrumentation amplifier circuit 
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lhai no gain-setting resistors are connected to cither of the input terminals. 
Instnimentalion amplifiers hate the following important characteristics 

(a) high impedance differenlial inputs, 

(b) low input offset \oltagc dnft, 

(c) low input bias currents, 

(d) gam casil) set b> means of one or two externa! resistors, 

(c) high common-mode rejection ratio 

Common-mode Rejection 

Common mode rejection ratio is an important parameter of differential amplifiers 
An ideal difTcrcniial input amplifier responds only to the \oltagc difference 
between its input terminals and docs not respond at all to any toltage that ts 
common to both input terminals (common mode \oltagc) In non-ideal 
amplifiers howc\er, the common-mode input signal causes some output 
response e%cn though small compared to the response to a differential input 
signal 

The ratio of differential and common mode responses is defined as the 
common mode rejection ratio (CMRR) Common-mode rejection ratio of an 
amplifier is the ratio of differential voltage gam to common'mode voltage gam and 
IS generally expressed in dB 

CMRR = 20 Iog,o(/la//lcM) (12 12) 

where is differential \oItage gam and /lot ts common-mode \o!tage gam 
CMRR IS a function of frequency and, therefore, also a function of the im- 
pedance balance between the two amplifier input terminals At c\cn moderate 
frequencies CMRR can be significantly degraded by small unbalances in the 
source senes resistance and shunt capacitance 


12,53 Other Amplifier Tj-pcs 

There arc se\cral other special amplifiers which arc useful in conditioning the 
input signal in a data acquisition s)’stem 
An isolation amplifier is used to amplify a differenlial signal which is super- 
imposed on a Ncry high common-mode voltage, perhaps se\eral hundred or 
e\-en seicral thousand \oIts The isolation amplifier has the characteristics of an 
instrumentation amplifier with a \cry high common-mode input voltage 
capabilit) Another special amplifier, the chopper stabilized amplifier, is used to 
accurately amplify microvolt level signals to the required amplitude This 
amplifier craplo)'5 a special sw itching stabilizer which gives extremely low input 
offset voltage dnft Another useful device, the electrometer amplifier, has 
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ultra-low input bias currents, generally less than one picoampere and is used to 
convert extremely small signal currents into a high-level voltage. 

Morrison (1977) treats the subject of d.c. amplifiers. 

12,5.4 Filters 

A low-pass filter frequently follows the signal processing amplifier to reduce 
signal noise. Low-pass filters are used for the following reasons; to reduce 
man-made electrical interference noise, to reduce electronic noise, and to limit 
the bandwidth of the analog signal to less than half the sampling frequency in 
order to eliminate frequency folding. When used for the last reason, the filter 
is called a pre-sampling filter or anti-aliasing filter. 

Man-made electrical noise is generally periodic, as for example in power 
line interference, and is sometimes reduced by means of a special filter such as a 
notch filter. Electronic noise, on the other hand, is random noise with noise 
power proportional to bandwidth and is present in transducer resistances, 
circuit resistances, and in amplifiers themselves. It is reduced by limiting the 
bandwidth of the system to the minimum required to pass desired signal 
components (refer to Chapter 11 for more detail). 

No filter does a perfect job of eliminating noise or other undesirable frequency 
components, and therefore the choice of a filter is always a compromise. Ideal 
filters, frequently used as analysis examples, have flat passband response with 
infinite attenuation at the cut-off frequency, but are mathematical filters only 
and not physically realizable. 

In practice, the systems engineer has a choice of cut-off frequency and attenua- 
tion rate. The attenuation rate and resultant phase response depend on the 



NORMALIZED FREQUENCY 


Figure 12.12 Some practical low-pass filter characteristics 
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particular filter characteristic and the number of poles m the filter function 
Some of the more popular filter characteristics include Butterworth, Chebyshev, 
Bessel, and elliptic In making this choice, the effect of overshoot and non- 
uniform phase delay must be carefully considered Figure 12 12 illustrates 
some practical low-pass filter response characteristics Their design is the 
subject of Chapters 9 and 10 

Passive RLC filters are seldom used in signal processing applications today 
due chiefly to the undesirable characteristics of inductors Active filters are 
generally used now since they permit the filter characteristics to be accurately 
set by precision, stable, resistors and capacitors Inductors, with their un- 
desirable saturation and tempera turedrift characteristics, are thereby eliminated 
Also, because active filters use operational amplifiers, the problems of insertion 
loss and output loading are also eliminated 

12 6 SETTLING TIME 


12 6 1 Definition 

A parameter that is specified frequently in data acquisition and distribution 
systems is settling time The term settling time originates in control theory but 
IS now commonly applied to amplifiers, multiplexers, and D/A converters 

Settling time is defined as the time elapsed from the application of a full scale 
step input to a circuit to the time when the output has entered and remained within 
a specified error band around its final value The method of application of the 
input step may vary depending on the types of circuit, but the definition still 
holds In the case of a D/A converter, for example, the step is applied by changing 
the digital input code whereas in the case of an amplifier the input signal itself is a 
step change 

The importance of settling lime in a data acquisition system is that certain 
analog operations must be performed in sequence and one operation may have 
to be accuratefy settfed before the next operation can be initiated Thus a buffer 
amplifier preceding an A/D converter must have accurately settled before the 
conversion can be initiated 

Settling time for an amplifier is illustrated in Figure 12 13 After application 
of a full scale step input there is a small delay time following which the amplifier 
output slews, or changes at its maximum rate Slew rate is determined by internal 
amplifier currents which must charge internal capacitances 

As the amplifier output approaches final value, it may first overshoot and 
then reverse and undershoot this value before finally entering and remaining 
within the specified error band Note that settling time is measured to the point 
at which the amplifier output enters and remains within the error band This 
error band in most devices is specified to either ±0 1 % or ±001% of the full 
scale transition 
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12.6.2 Amplifier Characteristics 

Settling time, unfortunately, is not readily predictable from other amplifier 
parameters such as bandwidth, slew rate, or overload recovery time, although 
it depends on all of these. It is also dependent on the shape of the amplifier open 
loop gain characteristic, its input and output capacitance and the dielectric 
absorption of any internal capacitances. An amplifier must be specifically 
designed for optimized settling time, and settling time is a parameter that must 
be determined by testing. 

One of the important requirements of a fast-settling amplifier is that it have 
a single-pole open-loop gain characteristic, that is, one that has a smooth 6 dB 
per octave gain roll-off characteristic to beyond the unity gain crossover fre- 
quency— a first-order response. Such a desirable characteristic is shown in 
Figure 12.14. 

It is important to note that an amplifier with a single-pole response can never 
settle faster than the time indicated by the number of closed-loop time constants 



Figure 12.14 Amplifier single-pole open-loop 
eain characteristic 
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to the gjven accuracy Figure 12 15 shows output error as a fuDCtjon o/ the 
number of time constants t where 

T = 1/w = l/2n/ (12 13) 

and / IS the closed loop 3 dB bandwidth of the amplifier 

Actual settling tune for a good quality amplifier may be significantly longer 
than that indicated by the number of closed-loop time constants due to slew 
rate limitation and o\erload recovery tune For example, an amplifier with a 
closed-loop bandwidth of 1 MHz has a time constant of 160 ns which indicates 
a settling time of 1 44 /is (nine time constants) to 0 01 % of final value If the slew 
rate of this amplifier is I V/ps, it will take more than 10 ps to settle to 0 01 % 
for a 10 V change 

If the amphfier has a non-umform gam roll-off charactenstic rather than a 
single-pole charactenstic, then its setthog time may have one of two unde- 
sirable quahties First, the output may reach the vicinity of the error band 
quickly but then take a long time to actually enter it, second, it may o\ershoot 
the error band and then oscillate back and forth through it before finally entermg 
and remaining inside it 

Modem fast-settimg operational amplifiers come in many different types 
mcluding modular, hybrid, and monolithic amplifiers Such amplifiers have 
setthng times to 0 1 % or 0 01 % of 2 ps down to 100 ns and are useful in many 
data acquisition and conversion appbcations 
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12.7 DIGITAL-TO-ANALOG CONVERTERS 


12,7.1 Introduction 

D/A converters are the devices by which computers communicate with the 
outside analog world. They are employed in a variety of applications from CRT 
display systems and voice synthesizers to automatic test systems, digitally 
controlled attenuators, and process control actuators. In addition, they are 
key components inside most A/D converters. D/A converters are also referred 
to as DAC’s and are termed decoders by communications engineers. 

The transfer function of an ideal 3-bit D/A converter is shown in Figure 12.16. 
Each input code word produces a single, discrete analog output value, generally, 
but not always, a voltage. Over the output range of the converter 2" different 
values are produced including zero; and the output has a one-to-one cor- 
respondence with input, which is not true for A/D converters. 



Figure 12.16 Transfer function of ideal 3-bit D/A 
converter 


There are many different circuit techniques used to implement D/A con- 
verters. but a few popular ones are widely' used today. Virtually all D/A con- 
verters in use are of the parallel type where all bits change simultaneously 
upon application of an input code word ; serial type D/A converters, on the other 
hand, produce an analog output only after receiving all digital input data in 
sequential form. 
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12.7.2 Weighted Current Source D/A Converter 

The most popular D/A converter design in use today is the weighted current 
source circuit illustrated in Figure 12 17 An array of switched transistor current 
sources is used with binary weighted currents The binary weighting is achieved 
by using emitter resistors with binary related values of 1?, 2R, AR, 8i?, 2"R 

The resulting collector currents are then added together at the current summing 
line 

The current sources are switched on or offfrom standard TTL semi-conductor 
device inputs by means of the control diodes connected to each emitter When 
the TTL input is high the current source is on, when the input is low it is off, 
with the current flowing through the control diode Fast switching speed is 
achieved because there is direct control of the transistor current, and the 
current sources ne\er go into saturation 

To interface with standard TTL levels, the current sources are biased to a 
base voltage of + 1 2 V The emitter currents are regulated to constant values 
by means of the control amplifier and a precision voltage reference circuit 
together with a binary transistor 

The summed output currents from all current sources that are on go to an 
operational amplifier summing junction, the amplifler converts this output 
current into an output voltage In some D/A converters the output current is 


m INPUT DATA 


+ Vs 12 3 



Figure 12 17 Weighted current source D/A converter 
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used to directly drive a resistor load for maximum speed, but the positive output 
voltage in this case is limited to about + 1 V. 

The weighted current source design has the advantages of simplicity and 
high speed. Both PNP and NPN transistor current sources can be used with 
this technique although the TTL interfacing is more difficult with NPN sources. 
This technique is used in most monolithic, hybrid, and modular D/A converters 
in use today. 

A difficulty in implementing the higher resolution D/A converter designs to 
this concept is that a wide range of emitter resistors is required and very high 
value resistors cause problems with both temperature stability and switching 
speed. To overcome these problems, weighted current sources are used in identi- 
cal groups, with the output of each group divided down by a resistor divider 
as shown in Figure 12.18. 

The resistor network, Ri through J? 4 , divides the output of group 3 down by a 
factor of 256 and the output of group 2 down by a factor of 16 with respect to 
the output of group 1. Each group is identical, with four current sources of the 
type shown in Figure 12.17, having binary current weights of 1, 2, 4, 8. Figure 
12.18 also illustrates the method of achieving a bipolar output by deriving an 
offset current from the reference circuit which is then subtracted from the out- 
put current line through resistor R^. This current is set to exactly one half the 
full scale output current. 



Figure 12.18 Current dividing the outputs of weighted current source 

groups 

12.7.3 R-2R D/A Converter 

A second popular technique for D/A conversion is the R-2R ladder method. 
As shown in Figure 12.19, the network consists of series resistors of value R 
and shunt resistors of values 2R. The bottom of each shunt resistor has a single- 
pole double-throw electronic switch which connects the resistor to either ground 
or the output current summing line. As in the previous circuit, the output 
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current summing line goes to an operational amplifier which converts current 
to \oItage 

The operation of the R-2R ladder network is based on the binary division of 
current as it flows down the ladder Examination of the ladder configuration 
reveals that at point A looking to the right, one measures a resistance of 2R, 
therefore the reference input to the ladder has a resistance of R At the reference 
input the current splits into two equal parts since it sees equal resistances in 
either direction Likewise, the current flowing down the ladder to the right 
continues to divide into two equal parts at each resistor junction 
The result is binary weighted currents flowing down each shunt resistor in the 
ladder The digitally controlled switches direct the currents to either the sum- 
ming line or ground Assuming all bits are on, as shown in the diagram, the 
output current is 

which is a binary series The sum of all currents is then 

/... =^(l-2-) (1215) 

where the 2~* term physically represents the portion of the input current 
flowing through the 2R terminating resistor to ground at the far right 
The advantage of the /?-2K ladder technique is that only two values of re- 
sistors are required, with the resultant case of matching or trimming and 
excellent temperature tracking In addition, for high speed applications relatively 
low resistor values can be used Excellent results can be obtained for high 
resolution D/A converters by using laser-trunmed thin film resistor networks 
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12.7.4 Multiplying and Deglitched D/A Converters 

The R~2R ladder method is specifically used for multiplying type D/A con- 
verters. With these converters, the reference voltage can be varied over the full 
range of + T^ax with the output being the product of the reference voltage and 
the digital input word. Multiplication can be performed in 1, 2, or 4 algebraic 
quadrants. 

If the reference voltage is unipolar, the circuit is a one-quadrant multiplying 
DAC; if it is bipolar the circuit is a two-quadrant multiplying DAC. For four- 
quadrant operation the two current summing lines shown in Figure 12.19 
must be subtracted from each other by operational amplifiers. 

In multiplying D/A converters, the electronic switches are usually imple- 
mented with CMOS devices. Multiplying DAC’s are commonly used in auto- 
matic gain controls, CRT character generation, complex function generators, 
digital attenuators, and divider circuits. Figure 12.20 shows two 14-bit multi- 
plying CMOS D/A converters. 

Another important D/A converter design takes advantage of the best features 
of both the weighted current source technique and the R-2R ladder technique. 
This circuit, shown in Figure 12.21, uses equal value switched current sources 
to drive the junctions of the R-2R ladder network. The advantage of the equal 
value current sources is obvious since all emitter resistors are identical and 
switching speeds are also identical. This technique is used in many ultra-high 
speed D/A converters. 

One other specialized type D/A converter used primarily in CRT display 
systems is the deglitched D/A converter. All D/A converters produce output 
spikes, or glitches, which are most serious at the major output transitions of^FS, 
jFS and |FS as illustrated in Figure 12.22a. 

Glitches are caused by small time differences between some current sources 
turning off and others turning on. Take, for example, the major code transition 
at half scale from 0111 ••• 1111 to 1000 • • • 0000. Here the MSB current source 
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Figure 12.20 CMOS 14-bit multiplying D/A converters 
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turns on while all other current sources turn off. The small difference in switching 
times results in a narrow half-scale glitch. Such a glitch produces distorted 
characters on CRT displays. 

Glitches can be virtually eliminated by the circuit shown in Figure 12.22b. 
The digital input to a D/A converter is controlled by an input register while the 
converter output goes to a specially designed sample-hold circuit. When the 
digital input is updated by the register, the sample-hold is switched into the hold 
mode. After the D/A converter has changed to its new output value and all 
glitches have settled out, the sample-hold is then switched back into the tracking 
mode. When this happens, the output changes smoothly from its previous 
value to the new value with no glitches present. Figure 12.23 shows a modular 
deglitched D/A converter which contains the circuitry just described. 

12.8 VOLTAGE REFERENCE CIRCUITS 

An important circuit required in both A/D and D/A converters is the voltage 
reference. The accuracy and stability of a data converter ultimately depends upon 
the reference; it must, therefore, produce a constant output voltage over 
both time and temperature. 

The compensated Zener reference diode with a buffer-stabilizer circuit is 
commonly used in most data converters today. Although the compensated 
zener may be one of several types, the compensated subsurface, or buried, 
Zener is probably the best choice. These relatively new devices produce an 
avalanche breakdown which occurs beneath the surface of the silicon, resulting 
in better long-term stability and noise characteristics than with earlier surface 
breakdown Zeners. These reference devices have reverse breakdown voltages 
of about 6.4 volts and consist of a forward biased diode in series with the 
reversed biased Zener. Because the diodes have approximately equal and 
opposite voltage changes with temperature, the result is a temperature stable 
voltage. Available devices have temperature coefficients from 100 ppm/°C 
to less than 1 ppm/°C. 

Some of the new IC voltage references incorporate active circuitry to buffer 
the device and reduce its dynamic impedance ; in addition, some contain tempera- 
ture regulation circuitry on the chip to achieve ultra-low temperature coefficient 
(tempco). 

A popular buffered reference circuit is shown in Figure 12.24; this circuit 
produces an output voltage higher than the reference voltage. It also generates a 
constant, regulated current through the reference which is determined by the 
three resistors. 

Some monolithic A/D and D/A converters use another type of reference 
device known as the band-gap reference. This circuit is based on the principle of 
using the known, predictable base-to-emitter voltage of a transistor to generate 
a constant voltage equal to the extrapolated band-gap voltage of silicon. This 
reference gives excellent results for the lower reference voltages of 1.2 or 2.5 V. 



518 


HANDBOOK OF MEASURE.MEVT SCIENCE 



Figure 12 24 A precision buffered voltage reference circuit 
12.9 ANALOG-TO-DIGITAL CONVERTERS 


12.9.1 Counter-tjpe A/D Converter 

A/D converters, also called ADCs or encoders, employ a variety of different 
circuit techniques to implement the conversion function As with D/A con- 
verters, however, relatively few of the many ongmally devised arcuits are 
widely used today Of the various techniques available, the choice depends on 
the resolution and speed required 

One of the simplest A/D converters is the counter, or smt), type This circuit 
employs a digital counter to control the input of a D/A converter Clock pulses 
are applied to the counter and the output of the D/A is stepped up one LSB at a 
time A comparator compares the D/A output with the analog input and stops the 
clock pulses when they are equal The counter output is then the converted 
digital word 

While this converter is simple, it is also relatively slow An improvement on 
this technique is shown in Figure 12 25 and is known as a tracking A/D con- 
verter, a device commonly used m control systems Here an up-down counter 
controls the DAC, and the clock pulses are directed to the pertinent counter 
input depending on whether the D/A output must increase or decrease to reach 
the analog input voltage 

The obvious advantage of the tracking A/D converter is that it can con- 
tinuously follow the input signal and give updated digital output data if the 
signal does not change too rapidly Also, for small input changes, the conversions 
can be quite fast The converter can be operated in either the track or hold 
modes by a digital input control 

12.9.2 Successive-approximation A/D Converters 

By far, the most popular A/D conversion technique in general use for moderate 
to high-speed applications is the successiie-approximation type A/D This 
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digital 

OUTPUT 

DATA 


method falls into a class of techniques known as feedback type A/D converters 
fo " cou^ type also belongs. In both cases a 

feedback loop of a digital control circuit which changes ds output unti it 
equals the analog input. In the case of the successive-approximation converter, 
DAC is Sled in an opilmum manner ,o complele a eonvers.on rn jnsl 
>1 stcns where ii is the resolution of the converter in bits. 

The'opcration of this converter is analogous to weighing “ 

laboratoVy balance scale using standard weights tn a binary 3““ “j; 

(, J, 1/1, kilograms. The correct procedure is to begin with the largest 

Standard weight and proceed in order down to the sma lest one^ 

The larcest weight is placed on the balance pan first; if it does not tip, 
wJght is « on L the'^next largest weight is added. If the ba ance does tip 
the weight is removed and the next one added. The 
for the next largest weight and so on down to the smallcs . c 
weight has been tried and a decision made, the weighing is ^h" ° 

of the standard weights remaining on the balance is the c oses p 

mation to the unknown. ricirrv n 

In the successive-approximation A/D converter ^ ® 

successive-approximation register (SAR) controls the / conver MSB 

menting the weighing logic just described. The SAR first turns on the MSB 
of the DAC and the comparator tests this output against the ^ B ' 

A decision is made by the comparator to leave the bit on or turn 1 o ‘ 

bit 2 is turned on and a second comparison made. After n compa 
digital output of the SAR indicates all those bits which remain on and 
the desired digital code. The clock circuit controls the timing of the b . 
Figure 12.27 shows the D/A converter output during a typical conversion. 
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Figure 12 26 Successne approximation A/D converter 



Figure 12 27 D/A output for 8 bit successive approximation 
conversion 

The conversion efficiency of this technique means that high resolution con- 
\ ersions can be made in very short times For example, it is possible to perform 
a 10-bit conversion m 1 /ts or less Of course the speed of the internal circuitry, 
in particular the D/A and comparator, are cntical for high-speed performance 

12.9J The Parallel (Flash) A/D Converter 

For ultra -fast conversions required in video signal processing and radar 
applications where up to 8 bits resolution is required, a different technique is 
employed , it is known as the parallel {also flash, or simultaneous) method and is 
illustrated in Figure 12 28 This circuitry employs 2" — 1 analog comparators 
to directly implement the quantizer transfer function of an A/D converter 
The comparator tnp-pomts are spaced I LSB apart by the series resistor 
chain and voltage reference For a given analog input voltage all comparators 
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Figure 12.28 4-bit parallel A/D converter 


biased below the voltage turn on and all those biased above it remain off. Since 
all comparators change state simultaneously, the quantization process is a 
one-step operation. 

A second step is required, however, since the logic output of the comparators 
is not in binary form. Therefore an ultra-fast decoder circuit is employed to 
make the logic conversion to binary. The parallel technique reaches the ultimate 
in high speed because only two sequential operations are required to make the 
conversion. 



BIT 1 2 3 4 5 6 7 8 

OUTPUT DATA 


Figure 12.29 Two-stage parallel 8-bit A/D converter 
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The limitation of the method, however, is m the large number of comparators 
required for e% en moderate resolutions A 4-bit converter, for example, requires 
only 15 comparators, but an 8-bil converter needs 255 For this reason it is 
common practice to implement an 8-bit A/D with two 4-bit stages as shown in 
Figure 1229 

The result of the first 4 bit conversion is converted back to analog by means 
of an ultra-fast 4-bit D/A and then subtracted from the analog input The 
resulting residue is then converted by the second 4-bit A/D and the two sets of 
data are accumulated m the 8 bit output register Converters of this type achieve 
8-bit conversions at rates of 20 MHz and higher, while single-stage 4-bit con 
versions can reach 50 to 100 MHz rates 

12.10 INTEG RATING-TYPE A/D CONVERTERS 


12.101 Indirect A/D Comersion 

Another class of A/D converters, known as integrating type, operates by an 
indirect conversion method The unknown input voltage is converted into a 
time period which is then measured by a clock and counter A number of 
variations exist on the basic principle such as single^slope, duaUslope, and 
triple slope methods In addition there is another technique— completely 
different— which is known as the charge^balancing or quantized feedback 
method 

The most popular of these methods are dual-slope and charge-balancing, 
although both are slow, they have excellent Imeanty characteristics with the 
capability of rejectmg input noise Because of these characteristics, mtegrating- 
type A/D converters are almost exclusnely used m digital panel meters, digital 
multimeters, and other slow measurement apphcations 


12.10.2 Dual-slope A/D Coniersion 

The dual-slope technique, shown in Figure 12 30, is perhaps the best known 
Comersion begms when the unknown input voltage is switched to the inte- 
grator mput, at the same time the counter begms to count clock pulses and 
counts up to overflow At this point the control circuit switches the mtegrator 
to the negative reference voltage which is integrated until the output is back to 
zero Clock pulses are counted dunng this time until the comparator detects the 
zero crossing and turns them off 

The counter output is then the converted digital word Figure 12 31 shows the 
mtegrator output waveform where T| is a fixed time and T 2 is a time proportional 
to the input voltage The times are related as follows 

r, = T.EJK,, (12 16) 
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DIGITAL OUTPUT 


Figure 12.30 Dual-slope A/D converter 


The digital output word, therefore, represents the ratio of the input voltage to the 
reference. 

Dual-slope conversion has several important features. First, conversion 
accuracy is independent of the stability of the clock and integrating capacitor 
so long as they are constant during the conversion period. Accuracy depends only 
on the reference accuracy and the integrator circuit linearity. Second, the 
periodic noise rejection of the converter can be infinite if Ti is set to equal the 
period of the noise. To reject 60 Hz power noise, therefore, requires that Tj 
be 16.667 ms. 



Figure 12.31 Integrator output waveform for dual-slope 
A/D converter 
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12.10J Charge-balancing A/D Con\ersion 

The charge-balancing, or quantized feedback, method of conversion is based 
on the principle of generating a pulse tram with frequency proportional to the 
input voltage and then counting the pulses for a fixed penod of time This 
arcuit is shown in Figure 12.32 Except for the counter and timer, the circuit 
IS a loltage-to-frequency (V/F) converter which generates an output pulse rate 
proportional to input voltage 

The circuit operates as follows A positive input voltage causes a current to 
flow into the operational integrator through This current is integrated, 
producing a negative going ramp at the output Each time the ramp crosses zero 
the comparator output triggers a precision pulse generator which puts out a 
constant width pulse 

The pulse output controls switch S] which connects R 2 to the negative 
reference for the duration of the pulse Durmg this time a pulse of current 
flows out of the integrator summing junction, producing a fast, positive ramp 
at the integrator output This process is repeated, generating a tram of current 
pulses which exactly balances the input current— hence the name charge 
balancing This balance has the following relationship 



where t is the pulse width and / the frequency 
A higher input voltage, therefore, causes the integrator to ramp up and down 
faster, producing higher frequency output pulses The timer circuit sets a fixed 
time penod for counting Like the dual-slope converter, the circuit also inte- 
grates input noise, and if the timer is synchronized with the noise frequency, 
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Figure 12 32 Charge balancing A/D converter 
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Figure 12.33 Noise rejection for integrating-type A/D 
converters 

infinite rejection results. Figure 12.33 shows the noise rejection characteristic 
of all integrating type A/D converters with rejection plotted against the ratio of 
integration period to noise period. 

12,11 ANALOG MULTIPLEXERS 
12.11.1 Analog Multiplexer Operation 

Analog multiplexers are the circuits that time-share an A/D converter among a 
number of different analog channels. Since the A/D converter in many cases is 
the most expensive component in a data acquisition system, multiplexing 
analog inputs to the A/D is an economical approach. Usually the analog multi- 
plexer operates into a sample-hold circuit which holds the required analog 
voltage long enough for A/D conversion. 

As shown in Figure 12.34 an analog multiplexer consists of an array of parallel 
electronic switches connected to a common output line. Only one switch is 
turned on at a time. Popular switch configurations include 4, 8, and 16 channels 
which are connected in single (single-ended) or dual (differential) configurations. 

The multiplexer also contains a decoder-driver circuit which decodes a 
binary input word and turns on the appropriate switch. This circuit interfaces 
with standard TTL inputs and drives the multiplexer switches with the proper 
control voltages. For the 8-channel analog multiplexer shown, a one-of-eight 
decoder circuit is used. 

Most analog multiplexers today employ the CMOS switch circuits shown 
in Figure 12.35. A CMOS driver controls the gates of parallel-connected P- 
channel and N-channel MOSFET’s. Both switches turn on together with the 
parallel connection giving relatively uniform on-resistance over the required 
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Figure 12 35 CMOS analog switch circuit 


analog input voltage range The resulting on resistance may vary from about 
50 ft to 2 kft depending on the multiplexer, this resistance increases with 
temperature 

12.11.2 Analog Multiplexer Characteristics 

Because of the senes resistance, it is common practice to operate an analog 
multiplexer into a \ery high load resistance such as the input of a unity gam 
buffer amplifier shown in the diagram The load impedance must be large 
compared with the switch on resistance and any senes source resistance in 
order to maintain high transfer accuracy Transfer error is the input to output 
error of the multiplexer with the source and load connected , error is expressed 
as a per cent of input voltage 
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Transfer errors of 0.1 % to 0.01 % or less are required in most data acquisition 
systems. This is readily achieved by using operational amplifier buffers with 
typical input impedances from 10® to 10‘^ fi. Many sample-hold circuits also 
have very high input impedances. 

Another important characteristic of analog multiplexers is break-before-make 
switching. There is a small time delay between disconnection from the previous 
channel and connection to the next channel which assures that two adjacent 
input channels are never instantaneously connected together. 

Settling time is another important specification for analog multiplexers; 
it is the same definition previously given for amplifiers except that it is measured 
from the time the channel is switched on. Throughput rate is the highest rate at 
which a multiplexer can switch from channel to channel with the output settling 
to its specified accuracy. Crosstalk is the ratio of output voltage to input voltage 
with all channels connected in parallel and off; it is generally expressed as an 
input to output attenuation ratio in decibels. 

As shown in the representative equivalent circuit of Figure 12.36, analog 
multiplexer switches have a number of leakage currents and capacitances 
associated with their operation. These parameters are specified on data sheets 
and must be considered in the operation of the devices. Leakage currents, 
generally in picoamperes at room temperature, become troublesome only at 
high temperatures. Capacitances affect crosstalk and settling time of the 
multiplexer. 

12.11.3 Analog Multiplexer Applications 

Analog multiplexers are employed in two basic types of operation; low-level 
and high-level. In 'high-level multiplexing, the most popular type, the analog 
signal is amplified to the 1 to 10 V range ahead of the multiplexer. This has the 
advantage of reducing the effects of noise on the signal during the remaining 
analog processing. In low-level multiplexing the signal is amplified after multi- 
plexing; therefore, great care must be exercised in handling the low-level signal 



Figure 12.36 Equivalent circuit of analog multiplexer 
switch 
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Figure 12 37 Flying capacitor multiplexer switch 


Up to the multiplexer Low-level multiplexers generally use two-wire differential 
switches m order to minimize noise pick up Reed relays, because of essentially 
zero senes resistance and absence of switching spikes, are frequently employed 
in low-le\el multiplexing systems They are also useful for high common-mode 
voltages 

A useful specialized analog multiplexer is the fiying^capacitor type This circuit 
shown as a single channel in Figure 1237 has differential inputs and is par- 
ticularly useful with high common-mode voltages The capacitor connects first 
to the differential analog input, charging up to the input voltage, and is then 
switched to the differential output which goes to a high input impedance instru- 
mentation amplifier The differential signal is, therefore, transferred to the 
amplifier input without the common mode voltage and is then further processed 
up to A/D conversion 

In order to realize large numbers of multiplexed channels, it is possible to 
connect analog multiplexers in parallel using (he enable input to control each 
device This is called single-leiel multiplexing The output of several multi- 
plexers can also be connected to the inputs of another to expand the number of 
channels, this method is double level multiplexing 

12 12 SAMPLE-HOLD CIRCUITS 


12.12.1 Operation of Sample-holds 

Sample-hold circuits, discussed earlier, are the devices which store analog 
information and reduce the aperture time of an A/D converter A sampJe-hoId 
IS simply a voltage-memory device in which an input voltage is acquired and then 
stored on a high quahty capacitor A popular circuit is shown m Figure 12 38 
Ai IS an input buffer amplifier with a high input impedance so that the source, 
which may be an analog multiplexer, is not loaded The output of Ai must be 
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Figure 12.38 Popular sample-hold circuit 


capable of driving the hold capacitor with stability and enough drive current 
to charge it rapidly. Si is an electronic switch, generally a FET, which is rapidly 
switched on or off by a driver circuit which interfaces with TTL inputs. 

C is a capacitor with low leakage and low dielectric absorption characteristics ; 
it is a polystyrene, polycarbonate, polypropylene, or Teflon type. In the case of 
hybrid sample-holds, the MOS type capacitor is frequently used. 

A 2 is the output amplifier which buffers the voltage on the hold capacitor. 
It must, therefore, have extremely low input bias current, and for this reason a 
FET input amplifier is required. 

There are two modes of operation for a sample-hold : sample or tracking mode, 
when the switch is closed; and hold mode, when the switch is open. Sample-holds 
are usually operated in one of two basic ways. The device can continuously 
track the input signal and be switched into the hold mode only at certain specified 
times, spending most of the time in tracking mode. This is the case for a sample- 
hold employed as a deglitcher at the output of a D/A converter, for example. 

Alternatively, the device can stay in the hold mode most of the time and go 
to the sample mode just to acquire a new input signal level. This is the case for a 
sample-hold used in a data acquisition system following the multiplexer. 

12.12.2 The Sample-hold as a Data Recovery Filter 

A common application for sample-hold circuits is in data recovery, or signal 
reconstruction, filters. The problem is to reconstruct a train of analog samples 
into the original signal; when used as a recovery filter, the sample-hold is 
known as a zero-order hold. It is a useful filter because it fills in the space between 
samples, providing data smoothing. 

As with other filter circuits, the gain and phase components of the transfer 
function are of interest. By an analysis based on the impulse response of a 
sample-hold and use of the Laplace transform, the transfer function is found to be 

/s Jl/Z/s 


(12.18) 
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Figure 12 39 Gam and phase components of zero-order 
hold transfer function 


where/, is the sampling frequeacy This function contains the famibar (sin x)/x 
term plus a phase term, both of which are plotted m Figure 1 2 39 
The sample-hold is, therefore, a low-pass filler with a cut-off frequency 
slightly less thauy^ and a bnear phase which results m a constant delay time 
of T/2, where T is the time between samples Notice that the gam function also 
has sigmficant response lobes beyond /, For this reason a sample hold re- 
construction filter is frequently followed by another conventional low-pass 
filter 


12.123 Other Sample-hold Circuits 

In addition to the basic circuit of Figure 12 38, there are several other sample- 
hold circuit configurations which are frequently used Figure 12 40 shows two 
such circuits which are closed-loop circuits as contrasted with the open-loop 
arcmtcf Figure i2 38 Figure f24(Jausesanoperationafin(egra(orandanof6er 
amplifier to make a fast accurate inverting sample hold A buffer amplifier is 
sometimes added m front of this circiut to give high input impedance Figure 
12 40b shows a high input impedance, non-mvertmg sample-hold circuit 
The circuit in Figure 12 38, although generally not as accurate as those in 
Figure 12 40, can be used with a diode-bndge switch to realize ultra-fast acquisi 
tion sample-holds 
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Figure 12.40 Accurate closed-loop sample-hold circuits; (a) in- 
verting; (b) non-inverting 


12.12.4 Sample-hold Characteristics 

A number of parameters are important in characterizing sample-hold per- 
formance. Probably the most important of these is acquisition time. The defini- 
tion is similar to that of settling time for an amplifier. It is the time required, after 
the sample-command is given, for the hold capacitor to charge to a full-scale 
voltage change arid remain within a specified error band around final value. 

Several hold-mode specifications are also important. Hold-mode droop is the 
output voltage change per unit time when the sample switch is open. This droop 
is caused by the leakage currents of the capacitor and switch, and the output 
amplifier bias current. Hold-mode feedthrough is the percentage of input signal 
transferred to the output when the sample switch is open. It is measured with a 
sinusoidal input signal and caused by capacitive coupling. 

The most critical phase of sample-hold operation is the transition from the 
sample mode to the hold mode. Several important parameters characterize 
this transition. Sample-to-hold offset or step error is the change in output voltage 
from the sample mode to the hold mode, with a constant input voltage. It is 
caused by the switch transferring charge onto the hold capacitor as it turns off. 

Aperture delay is the time elapsed from the hold command to when the 
switch actually opens; it is generally much less than a microsecond. Aperture 
uncertainty or aperture jitter is the time variation, from sample-to-sample, of 
the aperture delay. It is the limit on how precise is the point in time of opening 
the switch. Aperture uncertainty is the time used to determine the aperture error 
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due to rate ofchanse of the input signal Sc\eral of the abosc specifications are 
illustrated m the diagram of Figure 12.41 

Sample hold circuits are simple m concqjt, but generall> difficult to full} 
understand and applj Their operation is full of subllelies. and the) must, 
therefore, be carefall\ selected and then tested m a gl^•en appheatjoa. 

12,13 SPECmCATION OF DATA CONAERTERS 
12.13 1 Ideal Terras Real Data Conserters 

Real and D/A converters do not hav’etbe ideal transfer functions discussed 

earher There are three basic departures from the ideal offset, gam, and Itnearti) 
errors. These errors arc all preienl at the same lime in a converter, m addition, 
ihev change with both time and temperature. 

Figure 12.42 shows A/D converter trauifer functions which illustrate the 
three error tvpes. Figure 12.42a shows error, the analog error b) which the 
transfer function fails to pass through zero Next, m Figure 12.42b is gam error, 
aliO called scale factor error, it is the difference m slope between the actual 
transfer function and the ideal expressed as a per cent of analog magmtude. 

InRgure 12.42c /infant} CTTor,ornoii-Imeant), is showTi,lhis is here defined 
as the maaimum deviation of the actual transfer function from the ideal straight 
Ime at an) pomt along the function. It is expressed as a per cent of full scale or in 
LSB size, such a* ^ILSB, and assumes that offiet and gnin errors have been 
adjusted to aero 

Most A/D and DA converters available todav have provision for external 
trimming of oHiet and gam errors. B> careful adjustment these tw o errors can be 
reduced to zero, at least at ambient temperature. Lmeant) error, on the other 
hand, is the remaining error that cannot be adjusted out and is an inherent 
characteristic of the converter 
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Ic) 

Figure 1 2.42 (a) OiTset, (b) gain, and (c) linearity errors 


12.13.2 Data Converter Error Characteristics 

Basically there are only two ways to reduce linearity error in a given application. 
First, a better quality, higher cost converter with smaller linearity error can be 
procured. Second, a computer or microprocessor can be programmea 
perform error correction on the converter. Both alternatives may e expensive 
in terms of hardware or software cost. 

The linearity error discussed above is actually more precisely termed Integra 
linearity error. Another important type of linearity error is known as ijjeren la 
linearity error. This is defined as the maximum amount of deviation ot any 
quantum (or LSB change) in the entire transfer function from its i ea size o 
FSR/2". Figure 12.43 shows that the actual quan^m size may be larger or 
smaller than the ideal; for example, a converter with a maximum i eren la 
linearity of LSB can have a quantum size between i LSB and Ij Lbb any- 
where in its transfer function. In other words, any given analog step size is 
(1 ± {) LSB. Integral and differential linearities can be thought of as macro- 

and microlinearities, respectively. i * ^ t 

Two other important data converter characteristics are close y re a e o 
differential linearity specification. The first is monotonicity, whic appies o 
D/A converters. Monotonicity is the characteristic whereby the ° 

circuit is a continuously increasing function of the input. Figure 1 . as ows a 
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Figure 1243 Defining differential linearity error 


nonmonofonic D/A con\erier output where at one point, the output decreases as 
the input mcreases A D/A converter may go nonmonotonic if its differential 
linearity error exceeds 1 LSB, if it is aluays less than 1 LSB, it assures that the 
device will be monotonic 

The term missing code, or skipped code, applies to A/D converters If the 
differential Uneanty error ofan A/D converter exceeds 1 LSB, its output can miss 
a code as shown in Figure 12 44b On the other hand, if the differential linearity 
error is always less than 1 LSB, this assures that the converter will not miss any 
codes Missing codes are the result of the A/D converter's mtemal D/A con- 
verter becoming nonmonotonic 

For A/D converters the character of the lineanty error depends on the lech- 
tuque of conversion Figure 1145a, for example, shows the hneanty character- 
istic of an mtegratmg type A/D converter The transfer function exhibits a 
smooth curv ature betw een zero and full scale The predominant type of error is 
integral hneanty error, while differential hneanty error is virtually nonexistent 

Figure 1245b, on the other hand, shows the hneanty characteristic of a 
sucxessive approximation A/D converter, in this case differential hneanty error 



Figure 1244 Important D'A converter characteristics (a) Nonmonotonic 
D/A converter, (b) A/D converter with missing code 
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Figure 12.45 Linearity Characteristics of (a) integrating and (b) successive- 
approximation A/D converters 


is the predominant type, and the largest errors occur at the specific transitions at 

i i and I scale. This result is caused by the internal D/A converter non-line y, 

tile weight of the MSB and bit-2 current sources is critical in relation to all th 
other weighted current sources in order to achieve ±2 LSB maximum if- 
ferential linearity error. 


12.13.3 Temperature Effects 

Ambient temperature change influences the offset, gain, and linearity errors of a 
data converter. These changes over temperature are normally specified in 
parts per million (ppm) of full scale range per degree Celsius. When operating 
a converter over significant temperature change, the effect on accuracy must 
carefully determined. Of key importance is whether the device remains mo 
tonic, or has no missing codes, over the temperatures of concern, n man 
the total error change must be computed, that is, the sum o o se , ga 

linearity errors due to temperature. . --prp- 

The characteristic of monotonicity, or no missing codes, over a given e 
ture change can be readily computed from the differential linearity 
specified for a data converter. Assuming the converter initia y ^ ^ 

differential linearity error, the change in temperature for an increase o 
is given by 

(12.19) 


AT = 


2"" X 10® 
0 r»T T 


where n is the converter resolution in bits and DLT is the speci e 1 
linearity tempco in ppm of FSR/°C. AT is the maximum change in am 1 
temperature which assures that the converter will remain monotonic, o 
no missing codes. 
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12.14 SELECTION OF DATA CONVERTERS 

It IS necessary to consider a number of important factors in seJectmg A/D or 
D/A converters An organized approach to selection suggests drawing up a 
checklist of required characteristics An initial checklist should include the 
following key items 

(a) converter type, 

(b) resolution, 

(c) speed, 

(d) temperature coefficient 

After the choice has been narrowed by these considerations, a number of 
other parameters must be considered Among these are analog signal range, 
type of codmg, input impedance, power supply requirements, digital interface 
required, linearity error, output current dnve, type of start and status signals for 
an A/D, power supply rejection, size, and weight These parameters should be 
listed m order of importance to effiaenlly organize the selection process 
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Figure 12 46 Standard operating temperature ranges for data converters 


In addition, the required operating temperature range must be determined, 
data com erters are normally specified for one of three basic ranges known m the 
industry as commercial, irufustria/, or mthtary These temperature ranges are 
illustrated in Figure 12 46 Further, the level of reliability must be determined 
m terms of a standard device, a specially screened device, or a military standard 
device 

Pubhshed works providing detail include Hoeschele (1968), Schmid (1970), 
Sheingold (1972), and Zuch (1979) 
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Transmission of Data 


Editorial introduction 

Transferring information in electrical signal form over considerable distances began with 
telegraphic systems It expanded over time into an extremely sophisticated methodology 
Theoretical understanding was progressively improved providing a systematic and rigorous 
basis for implementing transmission systems capable of sending great quantities of data 
at selectably low error rates The original digital signal methods of telegraphy were aug- 
mented by the analog signal methods of telephony and, later, radio communication 

Telemetry of measurement data followed on to meet scientific and specialized industrial 
demands For the period until around 1960 analog systems of data transmission were 
more generally used From then on, however, the impact of low cost, high reliability, 
digital electronic systems became evident as methods of data transfer used in the ever 
expanding size of digital data processing systems found their way into instrumentation 
systems in general 

This chapter comprises two parts The first and major part is concerned with data 
transfer by digital means over extensive networks It is compiled from the viewpoint of an 
extensive computer based communications network this being the most suitable viewpoint 
to present at the time of compilation when such methods are finding their way more and 
more into such areas as process plants, monitoring systems, and stand-alone instrument 
system modules 

The second provides an outline of more specific signal transmission practice Analog 
methods, whilst still having a part to play, will undoubtedly decline in popularity as their 
inherent cost and relatively poor security of message transfer become greater penalties 
compared with digital alternatives 

13.1 INTRODUCTION 
13.1.1 Data Communication 

‘The fundamental problem of communication is that of reproducing 
at one point either exactly or approximately a message transmitted 
from another point.’ C. E. Shannon 
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Data communication has only come into use on a large scale m the past ten 
years Initially data terminals were used to input data traffic into large central 
main frame computers The connections were made either on direct circuits or 
dial up circuits through the use of the standard telephone exchange network 
Advances in computer technology reduced the cost of computers, which were 
then provided in regional centres There was an increase in the development of 
data base systems where customers had direct access to records in computers 
which could be updated and inspected using mainly visual display units (VDU) 
Methods initially developed for national teletechmque networks were gradually 
adopted for other applications such as process plant control and electrical 
supply network control 


13 12 International Standards for Data Communication 

A number of international bodies are concerned with the setting of stand irds 
for all types of communications The mam body particularly concerned with 
operations m this held is the International Telegraph and Telephone Consulta* 
live Committee (CCITT) which cooperates with the International Standards 
Organization (ISO) and the International Electro technical Commission 
(lEC) according to rules defined in Recommendation A20 of the CCITT 
The CCITT has published many relevant works (CCITT, 1964 1972, 1977a 
b, c, d) 

The CCITT has a responsibility for those aspects of data transmission which 
involve telecommunications networks or affect the performance of these 
networks There arc other topics such as standardization of the junction 
(interface) between the standardized modems and data terminal equipment 
which require agreement between CCITT and ISO Standards also have 
to be set for operating speeds of telegraph services and signalling and 
line control procedures for international working of the Telex network Many 
other bodies such as the Conference of European Postal and Telecommuni 
cations Administration (CEPT) and the International Federation of In- 
formation Processing (IFIP) contribute to the development of the international 
standards in the data field which are the primary responsibilities of CCITT 
and ISO 

Existing telecommunications networks have evolved until recent years 
without taking into account the requirement for data transmission which has 
developed rapidly Existing data communications facilities, therefore, fall short 
of the ideal and specialized networks designed specifically for data communi- 
cations are now being developed in several countries Standards for modulation 
rates and interfaces for networks specialized for data are now being developed 
by CCITT and ISO 
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13.2 TERMINAL EQUIPMENT AND CIRCUIT REQUIREMENTS 

13.2.1 Summary 

The transmission requirements of data communication circuits will depend on 
the needs of the terminal equipment. This section will give a brief summary of 
the signalling techniques and the types of terminal equipment provided. 

13.2.2 Data Signalling Principles 

Most telegraph machines, computers and other data processing devices operate 
with digital binary logic information which is transmitted in a coded sequence 
of events. Each element in the sequence has two possible states which are usually 
referred to as 1 or 0. In telegraph transmission the terms mark or space are used. 
The data terminal equipment presents these two binary states to a data trans- 
mission line as a direct current signal either as positive or negative potential, 
called double current signalling, or as the presence or lack of a potential, called 
single current signalling. 

It is the function of the data communication circuit to convert these logic 
signals into a more suitable medium for use over communication circuits and 
to convert them back to logic signals with minimum distortion at the distant 
terminal. 

Signalling rates 

The element of binary coded data is known as a bit (binary digit) which has a 
binary value of 0 or 1. Most simple communication systems, for example a 
telegraph machine, transmit this binary data in a serial binary form (one bit at a 
time) and the data communication rate can be expressed in bits per second 
{bitjs). 

Data codes 

The sequence of binary coded data is sent in the forms of characters usually 
either five or seven bits long which normally conform to CCITT standards. 

13.2.3 Asynchronous or Synchronous Transmission 

Having connected the digital signal generating machine to the communications 
line it is necessary to coordinate the data received with the data sent. 

With asynchronous transmission the characters are sent to line with a variable 
duration between characters (e.g. the random operation of the keyboard of a 
telegraph machine) in this case the receiving machine must know when a 
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character is about to be received so the appropriate timing and sampling mech- 
anisms can be initiated With asynchronous transmission each character is 
preceded by a start element This element will have the same duration as the 
normal signalling elements and would be opposite to the at rest or steady state 
hne condition The five bit or eight-bil character is then followed by 1, 1 5 or 2 
stop elements which are the same stale or polarity as the at rest condition The 
stop elements allow the receiving terminal time to process or print the character 
and set the receiver unit ready to receive the next character Any discrepancies 
in speed between the transmitter and receiver are taken up during the stop 
element pause 

Asynchronous transmission is used for low speed data transmission up to 
about 1200 bit/s 

With synchronous transmission each character immediately follows the 
previous character without any start or stop elements and all the bits in the 
character are of equal duration With this type of transmission the receiver must 
be kept in step with the transmitter and it is also necessary to know when 
transmission is about to commence so the separation between characters in the 
continuous bit stream can be recognized 

This type of transmission is used between computers and the more intelligent 
types of data terminals and operates at speeds greater than 600 bit/s Blocks of 
data from 200 to 1000 characters m length are usually transmitted at a tune, 
however, it is necessary to use special synchronizmg characters at the start of 
each block so the receiving terminal can obtain character as well as bit syn- 
chronization 

13.2 4 Telegraph Distortion 

Consider the signal m Figure 13 I The characteristics of the transmission hne 
will cause the detected signal at the receiving termmal to be distorted and the 



Figure 13 1 Distortion and subsequent detection of 
an example digital signal sequence 
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timing of the start and the end of the pulses will vary from the ideal. The receiving 
mechanism will sample the pulses at an optimum point near the centre of the 
pulse, however, the sampling period will have a finite width of 1-3 ms. 

If the total period of the pulse is 100% then the receiver should tolerate a 
variation of ±50% less than the ideal sampling period. A mechanical machine 
should tolerate ± 45 % distortion. Electronic equipment, with a shorter sampling 
time, could tolerate up to ±49% distortion. 

13.2.5 Data Circuit Terminology 

The following is a summary of the various types of data circuits and circuit 
configurations used : 

One way circuit. Previously known as half Duplex, the transmission is in one 
direction only with no return path. 

Either way circuit. Previously known as Simplex, transmission can occur in 
either direction but in only one direction at a time (equivalent to a telephone 
conversation). 

Both way circuit. Previously known as Duplex, transmission can occur inde- 
pendently in either direction at the same time. 

Backward channel control. Used on some data circuits when a lower speed 
channel, usually of 75 to 110 bit/s rate, is used on the return path to the sending 
station for supervisory and control purposes. 

Multistation circuits. A number of stations can be connected to a single one way 
circuit so they all receive the traffic from the sending station. 

Selective calling units (SCU). Devices to enable the outstation (on multistation 
circuits) to select and print a message with its own header. 

Polled working. Both way circuits can be used on multistations, however the 
outstation cannot send messages until it receives its own identifying code. 
The control station polls each outstation in turn and so controls the incoming 
traffic. 

Link prbcedures. On high speed circuits between computers or sophisticated 
terminals synchronous transmission is used with all messages occurring in 
blocks of fixed length. Each message block (or group of blocks) is acknowledged 
on the return path and so no traffic can be lost. 

Concentrators. On switching networks, where a large number of low traffic 
terminals are in one area, the circuits are combined into one, or more, higher 
traffic circuit to the main exchange or message switching centre. Concentrators 
only connect circuits through or forward messages on to the main centre and do 
not perform any local switching. 
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13^ 6 Design of Data Networks 

Manj design considerations should be taken mto account when selecting the 

optimum circuit configuration. When desienmg a network the designer should 

consider 

(a) W^at are the acceptable delaj-s in deli\ermg data messages or setlmg up 
connections'^ 

(b) How man} outlets and miels are required, are multistauon facihties or 
pollmg facilities required'^ 

(c) Are messages sent to more than one ter min al and are message or circmt 
switchmg facilities required^ 

(d) If workmg m a aimputer mterrogation mode what is the maximum 
permissible response time*^ 

(e) ^Tiat IS the acceptable error rate^ 

(0 ^^^lat message secunt} proceduresarerequircdandwhatmessagehandlmg 
procedures should be adopted 

(g) ^'ith Une switching s>'stems, w hat problems will occur jf excessii e calls are 
made to bus} outlets'’ 



Figure 13.2 Delnerj time and quantit} of data of tjpica] commumcations 
aremts (from Martin (1972a) reprinted bj permission of Prentice Hall 
Inc., Enslewood QilL, NJ) 
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(h) With message switching systems what is the maximum holding time of sent 
messages and how much message storage is required? 

(i) What emergency back up facilities are available in the case of component 
or circuit failures? 

(j) Are accounting and statistical facilities required? 

(k) What are the network control station requirements such as traffic statistics, 
circuit control, alarms, message retrieval? 

Figure 13.2 shows the delivery times and quantity of data for typical communica- 
tion circuits. 


13.3 GENERAL PRINCIPLES OF DATA TRANSMISSION 


13.3.1 Introduction 

Communication circuits were originally designed solely for voice communi- 
cation and much of the design and development work in data transmission is 
based on the principle of designing data terminals and data translation and 
supervisory equipment to make the best use of the circuits and bandwidth that 
is available. 

Voice frequency communication, because of the high level of redundancy, can 
tolerate frequency distortion, noise, and short interruptions on the circuit while 
still retaining a high level of credibility. 

Telegraph transmission of plain spoken language has a certain amount of 
redundancy and a few individual characters can be incorrectly received without 
any appreciable change to the information in the message. Any errors in a data 
message, if not in plain language, could cause problems and faulty operation 
of the terminal equipment. 

The basic data communication circuit is as shown in Figure 13.3. The trans- 
mitter is the input device which converts the internal information in the digital 
machine to a two-state, binary, sequential code. 

The converter (for example a data modem) converts this code into some 
other coded state that is more suitable for transmission over the communication 
circuit. Examples are 

(a) Voltage changes to convert the binary signals from the transmitter to a 
voltage level more suitable for d.c. transmission over a short physical 
circuit. 

(b) Frequency changes where the transmitter’s binary signals are converted 
either by amplitude, frequency or phase modulation to a frequency spec- 
trum compatible with the normal derived voice frequency operating 
telephone circuit. 
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Noise 

Figure 13 3 Schematic ofdata communication circuit 


The converter at the distant terminal will convert the incoming signal to a senes 
of binary bits acceptable to the receiver which then processes the binary code 
and performs some function in the receiving device The CCITT conventions 
for data transmission are as shown in Table 13 1 
A low speed telegraph circuit at 75 bii/s can transmit information at almost 
the same information rate as a verbal communication The telegraph channel 
will use much less of the frequency $p>ectrum and it is possible to fit 24 telegraph 
circuits into one loice frequency transmission (VFT) circuit However, the 
telegraph circuit is more prone to interference caused by noise and other line 
conditions 


Table 13 I CCITT conveniions for data transmission 



Digit 0 

Start' signal m start stop code 
Line available condition 
in tefex switching 
'Space' elemcntofstart stopcode 
Condition A 

Digit 1 

‘Stop’ signal in start-stop code 
Line idle condition 
in telex switching 

Mark’ element of start-stop code 
Condition Z 

Amplitude 

modulation 

Tone-off 

Tone-on 

Frequency 

modulation 

High frequency 

Low frequency 

Phase modulation 
with reference phase 

Opposite phase to ihe reference 
phase 

Reference phase 

Differential 
two phase 
modulation where 
the aflemalive 
phase changes are 0^ 
or 180° 

No-phase inversion 

Inversion of the phase 

Perforations 

No perforation 

Perforation 
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13.3.2 Intersymbol Interference 

When a series of binary signals or bits is sent via a communication circuit the 
characteristics of the circuit must be such that the receiving terminal can detect 
whether the bit is a 0 or 1 for every bit combination occurring in the signal. The 
signals could be distorted sufficiently such that one binary bit could interfere 
with the following bit. This is known as intersymbol interference (ISI), as 
depicted in Figure 13.4, where a signal is sent through a circuit with a low 
frequency response. 

Data circuits must be designed so they are adequately immune to the problems 
of intersymbol interference. 

13.3.3 Direct Current Signalling 

For short physical circuits direct current signalling can be used. Two basic 
types of digital signals are used for low speed telegraph machines; 

(a) Single current (SC) signals where a continuous current on line is equivalent 
to a mark and no current is a space. The control terminal normally sends 
40 mA for a mark and no current for a space. The signalling distance is 
restricted by the physical resistance of the cable pair. This type of signalling 
is suitable for one-way or either-way transmission. 

(b) Double current (DC) signalling which sends a negative voltage (usually 
— 50 V) to line for a mark and a positive voltage (usually -t- 50 V) for a 
space. The signalling current is 20 m A, This is an earth return type signalling 
and each leg of the pair is used for one direction only so independent 
signals can be sent from each end at the same time. The circuit can be used 
for one-way, either-way, or both-way transmission. 
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Figure 13.4 Distortion of a signal by intersignal interference 
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The signalling distance is restricted by the physical resistance of the cable pair; 
however, the distance can be extended with repeating relays 

13J.4 VFTSjstems 

Low speed telegraphy operates at speeds of 50 and 75 bit/s and only requires a 
bandwidth of approximately 120 Hz to give an acceptable reproduction of the 
input pulse at the receiving terminal It is possible to fit 24 telegraph circuits in 
the bandwidth of a normal voice frequency arcuit The standard system is the 
24-channel frequency-modulated (FM) VFT system Amplitude-modulated 
(AM) systems are no longer used as their performance is markedly inferior to 
the FM alternative 

I3J.5 Baseband Signalling Systems 

A baseband signalling system can be described as a system wherein the signals 
sent to line are a series of binary d c pulses and there is no modulation of a 
sinusoidal signal as for AM and FM systems These systems can operate up to 
at least 2 Mbit/s on cable pairs 

Baseband systems can change the voltage of the signals leaving the data 
terminal and in some cases baseband systems can convert the d c levels from the 
data terminal to a series of pulses These signalling systems are used only in 
physical cables in city areas and not over trunk circuits 
A full description of baseband signalling techniques is give in Section 13 4 

13.3 6 Data Modems 

Like VFT systems data modems convert the binary d c pulses into a signal form 
more suitable for transmission over VF or wider bandwidth circuits Data 
modems can be provided in many versions 

(a) Low speed frequency modulation up to 1200 bit/s These can transmit in 
both directions on a iwo-wire circuit using different frequencies in each 
direction 

(b) Phase modulation can operate from 2400 to 9600 bit/s using four-wire 
circuits 

(c) Baseband modems can send 200 to 48 kbit/s direct d c pulses over local 
cable used in the four-wire Horking mode 

(d) Backward channel signalling where a lower speed, say 75 bit/s, channel is 
added for signal acknowledgement and monitoring on the return path of 
the four-wire circuit in each direction 

(e) Higher speed modems operating at up to 5000 kbit/s over microwave 
links or coaxial cables (Characteristics of the various circuit carriers are 
discussed later in this chapter) 
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13.3.7 Regenerative Repeaters 

As a data signal is transmitted over a number of circuits the detected pulses can 
vary in timing (refer to Figure 13.1). Sometimes the signal is regenerated at some 
mid-point so that the repeated output signal can be given the correct timing 
between pulses. This is done by sampling the signal and transmitting a new 
signal at the time the sampling takes place — usually in the mid-point of the 
incoming pulse. The regenerated signal must always be at least half an element 
behind the incoming signal. 


13.3.8 Time Division Multiplexing 

Multiplexers are used to combine a number of low data speed circuits on to one 
high speed data circuit and so make more efficient use of the data circuits. 

Consider the modem operating at 9600 bit/s in Figure 13.5. The inputs from 
circuits 1, 2, and 3 are combined on to one high speed circuit and the signals for 
each circuit are then separated at the other end. It should be noted that the input 
circuits to the time division multiplexer (TDM) must operate at some sub- 
multiple of the multiplexer’s speed, also the inputs must be in synchronism with 
the multiplexer. For this reason the synchronizing clock in the multiplexer must 
control the incoming circuits and it would be difficult to operate this system with 
asynchronous signalling unless special circuitry is provided. Some elements in 
the multiplexer are required for control purposes. 

13.3.9 Asynchronous Multiplexers 

There are two methods of generating multiplexed signals from asynchronous 
signal inputs. 


Inputs 

Circuits Nol 01011- - - 

Circuits No 2 10011- — - 

Circuits No 3 110 0 1- - - 


Output from multiplexer 

12345123451234512345 etc 


0 

0 

0 

n 

□ 

0 

0 

0 

G 

□ 

0 

0 

0 

n 

n 

0 

0 

0 

□ 

□ 


Bit 1 Bit 2 Bit 3 Bit 4 etc. 

Multiplexer output is 5 times input speed 

Figure 13.5 Time division multiplexing of three digital signal 
circuits 
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Code and speed independent multiplexers 

These rely on sampling the input signal at a rate which is usually five or more 
times the maximum signalling rale In this case the timing error on sampling at 
the input to the multiplexer is one half the sampling period The modulating or 
signalling rate is five times the input rate and there would be a bandwidth 
restriction It should be noted that some of the bandwidth is used for control 
purposes 

Code and speed dependent multiplexers 

When the signalling code and the speed is known the input device on the 
multiplexer can sample and store the signal As soon as the complete character 
is received the signal can be sent into a synchronous multiplexer at the same bit 
rate as the input signal The input and output devices of the multiplexer must 
know the number of elements and the size of the stop and start elements 
Although the system is restricted to fixed signalling speeds many more data 
channels can be accommodated in the same bandwidth It should be noted that 
the signals are regenerated (see Section 13 3) when they pass through the 
multiplexer 

Character storage facilities must be allowed for at the input so that the 
multiplexer can cope with variation m the timing of the incoming characters 

13.4 BASEBAND SIGNALLING 


13.4.1 Introduction 

Baseband digital data transmission is a well established technique for data 
transmission over local circuits 

A baseband digital data signal can be defined as one that does not involve 
some form of modulation of a sinusoidal earner , the absence of the modulating/ 
demodulating circuitry usually results in a data transmitter/receiver of fess 
complexity and cost so baseband data transmission is adopted where possible 
Baseband data transmission can be used on physical cable pairs and coaxial 
cables 

While It is possible to use simple binary polar pulses to transmit the data 
information, several considerations usually make the adoption of some form 
of line coding desirable These are 

(a) Removal of the low frequency components This allows the data signal to 
be transformer-coupled to line to give protection against longitudinal 
voltages and also to permit d c line power feeding, d c y^etting m which a 
boost current is applied to break down insulating contact films, or auxiliary 
signalling 
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(b) Sufficient transmitted information is needed to enable the extraction of a 
clock synchronizing signal at the receiver. 

(c) The introduction of redundancy in the data to allow monitoring of the 
transmission performance and/or the transmission of line control informa- 
tion to be carried out. 

(d) The available bandwidth. If shorter pulses are used in the coding a greater 
bandwidth is required. 

(e) The shape of the signal spectrum to allow for minimum intersymbol 
interference. 

(f) The average power transmitted to line and the line losses. 

(g) The need for a receiver to be insensitive to polarity or line reversals. 


13.4.2 Line Coding 

Many different types of line codes are proposed for the transmission of data 
signals. Some of these types are shown in Figure 13.6. The advantages and 
disadvantages of each can be summarized as: 


(1) Non-return to zero (NRZ) binary 

This is the most common method for low speed telegraph transmission. Its 
advantages are that it uses simple technology and is suitable for asynchronous 
and synchronous operation. The disadvantages are the existence of a d.c. 
component, clock synchronizing problems, and that the signal is polarity 
sensitive. 


(2) Differentially coded NRZ binary 

A transition occurs for a 1 and no transition for a 0. Signalling characteristics 
are the same as for the NRZ binary system. 

The advantage of this method is that the receiver is not polarity sensitive. 
Disadvantages are the existence of a d.c. component and clock synchronizing 
problems. 


(3) Alternate mark inversion {AMI) 

A pulse of half the bit width is sent for each 1 bit and each pulse is inverted in 
relationship to the previous pulse. 

This has the advantage that a certain amount of error correction is available 
because the receiver can check to see if each pulse is of opposite polarity — less 
power is sent to line as the pulses are shorter. Its disadvantage is again the clock 
synchronizing problem — half-width pulses required greater bandwidth. 
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(4) Full AMI 

This IS similar to AMI, howexer, a full width pulse is used 
A key adxantage is that no d c component exists Furthermore, error detec- 
tion can be used It uses a smaller bandwidth than half-AMI Disadvantages are 
that It IS sensitive to all zero combinations and that more power is sent to line 
than with the AMI method 

(5) Polar return to zero 

Here half-v. idth pulses are used with one polarity for 0 and a different one for 1 
Advantages of this concept are that it is self-clocking, uses simple technology, 
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and requires less power to line than NRZ binary. Disadvantages here are that 
the receiver must be polarity sensitive and a wider bandwidth is required. 

(6) Delay modulation (Miller code) 

A 1 gives a transition in the middle of a bit; a 0 does not give a transition unless 
followed by another 0 then a transition at the end of the bit period. 

Advantages are that there are more transitions, even with a continuous zero, 
without increasing the bandwidth. 


(7) Conditioned diphase (Manchester code) 

Half-width pulses are sent with a polarity reversal for each bit. A 1 bit changes 
the polarity from the preceding bit. This is equivalent to phase modulation 
where a 1 bit changes the phase by 180° relative to the previous phase. 

Advantages are that it is not polarity sensitive and the receiver can start 
sampling at the centre or the end of the bit and still get the same result. Addition- 
ally the receiver is self-clocking, some error detection is available, and as equal 
pulses of both polarities occur no d.c. component exists. 

The disadvantage of this method is that a wider bandwidth is needed. This 
system is often used in storing and reading data from magnetic tape and disc 
systems. As there always occurs a transition for each bit it is easy to synchronize 
the clock on the incoming signals and to compensate for the slight speed 
variations which occur with electromechanical systems. 

(8) CMl coded ?7jark mversion 

A 0 bit always has a transition from negative to positive in the centre of the pulse. 
A 1 bit is always full width and alternate I’s are inverted. 

This has the advantages that the receiver is polarity independent for after a 
few bits are sent the receiver can be trained to know which polarity is meant to 
be negative. This is useful if special control pulses are required during a signal 
transmission. Error recognition is possible. Clocking is also possible but would 
not be as regular as for the conditioned diphase method. No d.c. component 
or d.c. drift problems arise. 

13.4.3 Scrambling 

The problems with d.c. components, clocking, and automatic gain control 
(AGC) when all zero combinations are sent can be alleviated by scrambling 
the message. That is, the transmitted bits are mixed with a continuous fixed 
pattern signal so that there is a continuous combination of O’s and I’s. The 
message is descrambled back to the original message at the receiver. 
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13 4 4 Frequency Spectrum of Baseband Signals 

Referring to Chapter 4 a raised cosine-shaped pulse and the raised cosine shape 
on the filter cut off give the best performance for pulse transmission A number 
of tests ha% e been carried out on some of the line signalling codes mentioned in 
Section 13 4 2 Measurements were made in each case with random character 
combinations and the power spectral curves are as shown m Figure 13 7 
It should be noted that the AM!, both normal and interleaved, gives a fre- 
quency response similar to the raised cosine curve, however, half AMI (which 
also has a curve similar to a raised cosine) sends less power to line but requires 
twice the frequency spectrum The type of signalling system to be used will 
depend on the attenuation of the line and the data rate required 
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Figure 13.8 48 kbit/s baseband modem 


13.4.5 Typical Baseband Modem 

Figure 13.8 shows the layout of a typical baseband modem. The transmitted 
signals are first encoded and scrambled and then passed through a band- 
limiting filter to remove unwanted higher frequency components. 


13.5 PHASE MODULATION 


13.5,1 Introduction 

With normal phase modulation the phase shift of the carrier is modulated by a 
variable analog signal. With digital data transmission we are concerned solely 
with phase shift keying where the carrier phase is not altered more than ± 180° 
at any time and the phase is shifted in definite fixed steps. 

Phase modulation is used at data speeds greater than 2400 bit/s. In the 
simplest version a carrier frequency is shifted 180° from its previous value to 
indicate a change in the state of the bit (0 or 1). 

It is more common to use four phase shifts and a 2400 bit/s signal can be 
generated if a 1200 Hz carrier is modulated with two bits at a time to give four 
combinations of phase shift at 0°, 90°, 180°, and 270°. This can be extended to 
eight combinations of phase shifts in multiples of 45° using three bits. By using 
combined amplitude and phase modulation four bits at a time can be selected 
and data will be transmitted at a bit speed of four times the carrier. For example, 
if a 2400 Hz carrier is used, the data rate will be 9600 bit/s which can be trans- 
mitted over a normal telephone circuit. Table 13.2 shows the phases used for 
various levels of modulation. 
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Table 13 2 Binary and phase values for various PM 
sjstems (a) two-bit modulation, (b) three bit modulation, 
(c) four-bit modulation, combined PM and AM modulation 

(a) 


Bit 1 

Bit 2 

Alternative A 

Alternative B 

0 

0 

ff* 

45° 

1 

0 

-FW 

-1-135° 

1 

1 

-H80’ 

4-225° 

0 

1 

-»-270’ 

•4315° 


(b) 

Three bit values 

Phase change* 

Bitl 

Bit 2 

Bit 3 

0 

0 

I 

0° 

0 

0 

0 

45° 

0 

1 

0 

90* 

0 

1 

1 

135° 

1 

1 

1 

180* 

1 

1 

0 

225° 

1 

0 

0 

270° 

1 

0 

1 

315° 


(c) 

Bit 1 

Bit 2 

Bit 3 

Bit 4 

Phase change* 

Bit 1 = 0 

0 

0 

1 

0° 


0 

0 

0 

45° 

Low fevef 

0 

I 

0 

90° 


0 

1 

1 

135° 

Bit 1 = 1 

1 

1 

1 

180° 


1 

1 

0 

225° 

High level 

1 

0 

0 

270° 


I 

0 

! 

315° 


* The phase change is the actual on-line phast shift in the 
transition region from the end of one signalling element to 
the beginning of the following signalling element 
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The signal at the receiving terminal can be analysed by comparing the received 
signal with an internal fixed reference signal, however, there could be problems 
in maintaining synchronism. To avoid the problem of having to transmit a 
reference carrier phase against which the phase of the signal can be compared, 
the receiving clock is kept in synchronism by the timing of the phase shifts 
between symbols. This form of modulation is often referred to as differentially 
coherent phase shift keying (DCPSK). 

Loss of synchronism can occur if a long string of binary O’s is transmitted 
because the absence of phase changes at the receiving end does not permit the 
recovery of timing information for automatic adjustment of the receiving clock. 

It is worth noting that the probability of error in the detection process depends 
in some degree on the specific assignment of the binary code symbols for the 
various phase shifts. The most probable error is that of interpreting a particular 
phase change as one of the immediately adjacent possible phase changes. The 
coding of the symbols in Table 13.2 was assigned by CCITT so that symbols 
have a minimum of difference to adjacent phase change conditions. The code 
used is known as the Gray code which has only one bit changed for adjacent 
symbols as shown in Table 13.3. 

A phase modulated signal with the instantaneous phase jumps as set out in 
Table 13.2 will have a wide frequency spectrum and it is normal to adopt 
measures to limit the spectrum of the signal. For example, after phase modu- 
lation, filtering may be used; alternatively, the carrier may be amplitude 
modulated and the phase jumps applied when the carrier is a minimum. A 
typical power spectrum for a 1200 baud four-phase modem (2400 bit/s) is shown 
in Figure 13.9. (Transmission rate of pulses is measured in baud, the number of 
pulses per second.) 

Several techniques exist that can be used to demodulate the phase modulated 
signal, but when demodulating for example, a four-phase signal, the basic 
procedure is to divide the circle representing 360° into four quadrants as shown 
in Figure 13.10. The correct phase vectors are situated at the centres of the 
quadrants or decision regions and if the demodulated phase falls inside a given 


Table 13.3 Gray code 


Binary number 

Gray coded number 

000 

000 

001 

001 

010 

on 

on 

010 

100 

no 

101 

111 

no 

101 

111 

100 
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Ffequency (Hil 


Figure 139 Relaiive power spectral density of a 
particular four-phase data modem operating at 2400 
bil/s 

quadrant, the phase vector associated with that quadrant is assigned to be the 
decision of the receiver The added noise and the phase jitter to the signal is 
represented by the vector n(t) 

13.5.2 Inteiference Effects on Phase Modulation 

Normally if the noise causes an error in the detection of the phase shift, the 
incorrectly detected phase shift will be one on either side of the correct one 
Consequently the bit error rale is minimized if the Gray code is used 
A further interesting point (hat emerges from consideration of operation of 
the differentially coherent phase modulation system is that the comparison of 
these two successive phases can be made before or after these phases are quan- 
tized in the receiver If the phases are first quantized and the successive phases 



Figure 13 10 The deasion regions of a 
four-phase data modem 
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subtracted to give the phase change, an error in detection in a given phase will 
affect two successive subtractions and hence two dibits will be in error with 
probably only one bit in error per dibit due to coding of the phase changes (see 
Table 13.2). This means that errors will tend to occur in pairs. However if the 
successive phases of each dibit symbol are first subtracted and then quantitized 
to the changes shown in Table 13.2 then the likelihood of double errors is 
considerably reduced. The added noise can be represented by the vector, n(0, 
as shown in Figure 13.10. As long as the resulting phase vector falls inside the 
quadrant, the correct decision will be made. For Gaussian noise, the probability 
of an incorrect decision for four-phase modulation is plotted in Figure 13.11 as 
a function of the signal-to-noise ratio (SNR). 


13.5.3 Synchronizing and Training Procedures for PM Systems 

Before data can be transmitted over a communication circuit using PM modula- 
tion the following circuit conditions must be established : 

(a) the equalizers in the receiver must be adjusted to the line conditions; 

(fa) the received signalling level must be adjusted ; 

(c) the receiving clocks must be in synchronism with the transmitter; 

(d) the scrambler and descrambler must be in synchronism. 



Figure 13.11 Probability of error (PJ as a 
function of the signal-to-noise ratio (p) of a 
four-phase system: = ^e”'’ 
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The data terminal indicates that it wishes to transmit by sending a request to 
send signal to the modem The modem replies with a ready to send signal to the 
data terminal when all conditions have been met This line-up procedure is done 
automatically in most modems over 4800 bit/s and is known as syiichromzing 
or training procedures 

CCnr recommendations V27 and V29 (CCITT 1977a-d) describe standard 
training modes or turn-on sequences 

13.6 HIGH SPEED OR WIDEBAND DATA TRANSMISSION 

13.6.1 Introduction 

CCITT recommendations V35 and V36 specify standards for wideband data 
systems having bandwidth greater than 3 3 kHz Data transmission at higher 
speeds requires a greater bandwidth Two methods are used 

(a) direct baseband signalling on non-loaded cable pairs using local telephone 
circuits up to 48 kbit/s, 

(b) Modulation of the data to operate in the basic 12-channel VF group 
(60 to 108 kHz) Speeds of up to 168 kbi(/s can be obtained » however, the 
standard rate of 48 kbit/s is more common 

13.6.2 48 kbit/s Baseband System 

The basic layout of a typical 48 kbit/s baseband local circuit is as shown in 
Figure 13 8 The transmitter has five basic elements 

(a) A signal converter to convert the binary series input from the data set to 
signal levels suitable for transmission through the system 

(b) A clock to control the signals from the data set to the modem so they are 
sent at the correct bit rate 

(c) An encoder/scrambler to generate the appropriate line code (Section 
13 4 3) and to eliminate any d c component in the signal chain in at rest 
conditions It also ensures that there are sufficient transitions to provide 
clock synchronism at the receiver terminal 

(d) A wave shaper to set the slope of the rectangular waves to reduce harmonics 
and intersignal interference 

(e) A limiter, output level adjuster, and line interface 

The receiver has six basic elements 

(a) The receiving line amplifier, filter, and line equalizer 

(b) An amplifier with built in AGC to compensate for line variations 

(c) A bit timing detector which uses the incoming transitions of the data signal 
to synchronize the receiving clock 



TRANSMISSION OF DATA 


561 


(d) A signal regenerator where the received signal is regenerated so that it is 
presented to the data set with the correct timing between elements and 
without variation in the timing caused by the line conditions. 

(e) A decoder and descrambler to convert the signal back to the input code. 

(f) A signal converter and data set interface to convert the signal to voltage 
levels acceptable to the data terminal equipment. 

13.6.3 48 kbit/s Vestigal Sideband Transmission Modem 

A 48 kbit/s modem working in the standard 60-108 kHz group bandwidth uses 
vestigal sideband transmission (VSB). In many cases a voice frequency channel is 
included. 

A block schematic of a typical modem is as shown in Figure 13.12. The 48 
kbit/s data signal is amplitude modulated by a 100 kHz carrier to produce a 
36 kHz bandwidth in the range of 64-108 kHz which represents one sideband 
of the modulated carrier. This sideband and the 100 kHz carrier are sent to line. 
It is essential that the 100 kHz carrier be detected at the receiving terminal as this 
will be used to demodulate the sideband: absolute synchronous detection must 
be provided {homochronous demodulation). 

The modem shown in Figure 13.12 allows for a voice frequency channel in 
the 104-8 kHz range which is combined with the data carrier at the transmitter 
hybrid. The signal is separated by band-pass filters and detected at the receiving 
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Figure 13.12 Schematic of 48 kbit/s VSB modem 
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terminal The receiver uses AGC, line equalizers, and automatic phase com- 
pensation where the phase shift of the 100 kHz carrier is adjusted so that it is in 
correct phase relationship with the data sideband The 100 kHz signal is then 
used to demodulate the carrier to produce the 48 kbit/s data 


13 6 4 Other Wideband Data Techniques 

The vestigal sideband system of modulation descnbed m the previous section 
is basically a direct modulation of the baseband signal and the system could also 
be used by wideband analog-type facsimile transmission with a bandwidth of 
about 35 kHz The main problem with this system is that a reference carrier 
must be sent to provide homochronous detection and the reference earner is at 
the upper or loiver end of the wideband spectrum where it is most susceptible 
to the effects of distortion due to group delay 
Two other methods can be used for wideband data transmission These 
methods are particularly suitable for synchronous bit-dependent data trans- 
mission These techniques are as follows 

(a) Split the signal into two sections which are then modulated m quadrature 
with double sideband techniques and suppressed carrier This is similar to 
the technique used in phase modulation By using coherent signal detection 
with the carrier frequency at the centre of the band the effects of delay 
distortion are reduced 

(b) Use duobwary signalling with differential encoding of the data 


13.7 TRANSMISSION CHARACTERISTICS OF DATA CIRCUITS 
13.7.1 Introduction 

This section desenbes the characteristics of physical and derived VF circuits in 
the communication network and how the impairments of these circuits can 
effect data transmission There are two basic classes of characteristics 

(a) Passive charactenstics, such as the fixed make-up of the transmission line 
in, for example, its bandwidth frequency response, and attenuation 

(b) Dynamic characteristics which arc caused by external influences such as 
noise, phase jitter or loss of synchronism 


13.7.2 Passiv e Characteristics of VF Circuits 

CCITT recommendations M1020 and M1050 specify the ideal frequency 
response and group delay limits for a VF data arcuit These specifications, which 
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were detailed by the sixth plenary assembly in 1976, supersede the earlier 
recommendation M102. 

A power level of — 10 dB below normal speech level has been specified as the 
sending level for data transmission when a continuous tone is being transmitted 
to line. This level has been selected to prevent overloading of multichannel 
communication circuits which are used for both data and speech circuits. 
Although speech circuits can be set to transmit at OdBmO the average data 
level is between — 10 and — 15 dB. 


Equalizers 

Since speech is not sensitive to delay distortion, the telephone network has 
largely developed without this parameter being specified, but when data are 
transmitted over the network the delay distortion can very seriously degrade 
the signal especially for high data rates. Some form of compensation for the 
delay will then be needed. The effect of group delay distortion on data trans- 
mission is to cause interference between adjacent symbols of the received data. 
Normally this intersymbol interference is insufficient to cause errors by itself 
but rather it makes the data more susceptible to noise disturbances. 

As linear distortion can cause intersymbol interference, it is better to adjust 
the equalizer to minimize this interference at the sampling time of the receiving 
data modem rather than attempt to fit the frequency response of the channel 
into some arbitrary criterion. Because the intersymbol interference is more easily 
represented in the time domain than the frequency domain, the form of the 
equalizer that best minimizes the intersymbol interference is a transversal filter 
which gives weighted versions of the original signal at a range of delays. In fact, 
it is this form of equalizer that lends itself to being made adaptive by observing 
the intersymbol interference in the data at the receiving terminal and adjusting 
the weighting of the various delays to minimize this interference. 

13.7.3 Dynamic Characteristics 

The noise, phase jitter frequency errors and other characteristics are also 
specified in CCITT recommendation M1020. The main dynamic characteristics 
are as follows: 


Random noise 

This is wideband noise generated by thermal effects and power induction and 
which usually remains at a fairly steady value with respect to time. The 
recommended limit is —38 dB. Figure 13.13 shows typical random noise 
limits, and Figure 13.14 shows the relationship of SNR to error rate. 
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Circuit langlh (km) 

Figure 13 1 3 Random noise circuit performance 


Impulse noise 

In contrast to random noise this is quite variable with respect to time and can 
depend upon the time of day and the routing of the circuits Sources of impulse 
noise are inductively or capaatively coupled dialling impulses and transients 
produced m exchanges by the operation of electromechanical switching equip* 
ment Data modems can be designed to withstand the effects of this coupled 
impulse noise provided the network is maintained to specified limits The 
recommendation is that the number of impulse noise peaks exceeding -21 dB 
should not be more than 18 in any 15-minute period 


Phase Jitter 

This IS the regular or cyclic variation from correct phase of the received line 
signal Low phase Jitter depends upon good design and maintenance in newer 
equipment tighter design specifications have been used Phasejitter must be less 
than 15° peak*to-peak 


A phase hit 

This is a sudden jump in the phase of the line signal received by the modem 
These sudden jumps are generally caused by the switching of radio bearer 
circuits from normal to standby and vice versa Phase hits are reduced by good 
design and good installation practices which ensure that, for example, there will 
be no dilTerence in the propagation time between radio bearers 
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Signal -to -noise ratio {white noise) (dB) 



Signol-to-noise ratio (pulse noise)(cl0 


Figure 13.14 Relationship of signal-lo-noise ratio to error rate: (a) white noise; 

(b) pulse noise 


Frequency asyncbronism 

This is due to differences in systems carrier supplied, for example a 1000 Hz tone 
sent into a channel may be received at the distant end as 1005 Hz (the usual 
limit). This distortion causes serious errors in low speed modems using frequency 
shift keying. The solution to this is good maintenance practices with respect to 
the synchronization of carrier supplies. 
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Generally the higher the speed the more significant is the effect of the line 
parameters Hence there must be greater attention gi%en to the application of 
transmission pl annin g principles and maintenance and operation practices as 
the speed of transmission increases 

The characteristics of wideband circuits and the expected error rates for 
\anous SNR’s are discussed in detail in CCITT (1972) 

13^ ERROR DETECTION AND CORRECTION 


13^ 1 Introduction 

On all data links there is ah\a>s a possibility of the generation of errors This 
section descnbes some methods of error detection and correction after the 
signal has been translated back into its bmary code The three mam methods of 
error detection used are as follows 

(a) Ciiaracter parit\ checks where an extra bit is sent for each character 

(b) Block parity checks w here parity characters are also sent at the end of each 
block of data. 

(c) Special data codes where extra redundancy is introduced mto the data 
using special code combinations which provide checkmg facilities 

13^.2 Character Pant) Bits 

The CCITT No 5 alphabet contams seven bits of information and one panty 
bit (the eighth bit) TTie parity bit can be either even or odd panty For even 
pant) the eighth bit is set to either 0 or Iso there are an even number of I’sm the 
character Character panty checkmg can be used for asjmchronous or s)!! 
chronous transmission and some receiving terminals will indicate when a 
character is in error The sj'stem wiU not work if more than one error occurs in 
a character or if a burst of errors or a short break in transmission occurs 


13.83 Block Pant) Checks 

With synchronous transmission a block of characters is sent at a time Sjn 
chronizing characters are sent at the start of the block and a panty check 
characterissentattheendoftheblock Thissystem usesa two-coordinafepanty 
check A panty bit is allocated to check each character or column and a panty 
character at the end of the block acts as a longitudmal check and indicates if an 
even or odd number of Ts occur for each horizontal row in the block (see 
Figure 13 15) 

If one error is indicated on the horizontal and vertical axes it is possible to 
detect w hich bit is in error and to correct this bit Bit errors cannot be detected 
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Figure 13.15 A fixed format of data; 
nine seven-bit characters. CP, ‘column’ 
parity bits; RP, row parity bits 


if four errors occur on two horizontal and two vertical axes and they cancel out. 
The probability of quadratic errors of this type occurring is very low. 

Error correction is not often used in block transmission on normal trans- 
mission circuits. It is not possible to correct more than one error in a block and 
errors often occur in short bursts. 

The usual procedure is for the receiving terminal to request retransmission 
of the data block by sending a short message to the transmitter over the back- 
ward signalling channels. 

Blocks of data transmitted over a high speed data circuit are usually numbered 
in sequence so the receiving terminal also checks block sequence numbers to 
detect long line breaks. The backward channel is used to acknowledge every 
block or every group of blocks. 

13.8.4 Determination of Optimum Block Size 

The terms given below are used in the formulations following ; 

w number of data bits in the forward message 
n total number of bits in the forward message 
S number of supervisory and synchronizing bits in forward message 
K number of error detecting bits in the forward message 
9 number of bits in the backward path acknowledgement message 
Df propagation time of the forward data path 
propagation time of the backward data path 
B[ data speed of forward channel 
data speed of backward channel 
C computation time to validate the block 

The redundancy factor can be expressed as the percentage ratio of control bits to 
data bits. The redundancy factor is given by 


m 
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The redundancy can be made up of panty bits, start and stop elements for 
asynchronous characters, block control characters, and other non-message 
needs When data are sent over high error rate circuits (e g space probes) a high 
redundancy factor is necessary to ensure error free transmission 
The optimum block size will depend on 

(a) The bit error rate of the channel which will determine the number of 
retransmitted blocks 

(b) The redundancy factor of the block 

(c) The transmission speed which also determines how long a block is to be 
stored at the sending terminal before it is acknowledged and discarded 

(d) The loop propagation time of the channel including the time of the 
acknowledgement on the return path 

(e) The message length, there may be more than one block per message and 
some blocks will only be partly filled with valid data 

(0 The storage capacity in the transmitter to hold blocks while waiting 
acknowledgement 

(g) The message traffic rate for the channel which determines what spare time 
IS available to repeat blocks 

The storage time T to hold a block after transmission while waiting acknowl- 
edgement IS given by 

T = Df + Df, + g/B[, + C 

In order to simplify the logic circuits, it is desirable that the decision on the 
block validity be made at the sendmg station during the transmission of the 
subsequent block which would also have to be stored, then T should be less 
than nIDf The total holding time including forward transmission is 

T + n/Bf 

The storage capacity necessary would then be for two blocks of minimum 
length n as determined above 

This is an important factor in the determination of block length for medium 
and high speed systems However, no definite recommendation is possible until 
the speed of the forward and backward channels, the amount of redundancy 
(that IS, the error-detection code arrangement), and the distance over which the 
system is to operate are each determined 

13 8.5 Probability of Block Errors 

The probability of an error occurring in a block or a message is given in the 
approximate formula 


= nPb - i«(n - l)Pi 
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where Pb is the probability of bit errors and is the probability of a message 
error. 

The effective speed of transmission for different data speeds and block sizes 
is shown in Figure 13.16. This takes into account the turnaround response and 
block retransmission due to errors and control characters. 

An error rate better than I bit in 10^ can be expected on a normal 9600 bit/s 
circuit. It is best to design for a higher than normal error, as is shown in Figure 
13.16. 

13.8.6 Other Types of Error Detection and Correction 

The systems of character parity checks and block parity checks are satisfactory 
over normal communication channels with a bit error rate better than 1 bit in 
10“*. High error rate circuits, such as low performance radio links, satellites, and 
space probes, require a higher level of redundancy. The technique is to select an 
error detection and correction code to give the most reliable operation with the 
least amount of redundancy. Bennett and Davey (1965), CCITT (1964, 1972), 
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Figure 13.16 Effects of line errors 
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Due (1973, 1974, 1975), and Lucky et al (1968), describe various codes which 
can be used Only the following few examples can be described here 


The m out of n code 

For this code n bits of information are sent but only m I’s must occur in each 
character An error is indicated if more or less than the presenbed m I’s are 
received No error correction is possible In general an m out of n code permits 
nV[m’{n — m)'] combinations out of 2" possible ones 
A typical example of this code is the three out of five code used in multi- 
frequency code (MFC) signalling in telephone exchanges which gives ten 
permissible combinations out of a possible 32 The CCITT standard No 3 
alphabet uses a four out of seven code 


Null zone detection 

The receiver makes a decision to recognize a 0 or a 1 dependmg on the position 
of the signal within an acceptable eye pattern area This can output a positive or 
negative level to a decoder A circuit could be developed to give a third position 
of null or zero if the received signal lies outside certain limits m the eye pattern 
If this method was used m conjunction with standard block parity checks a high 
level of reliability would result as bursts of noise or other disturbances and low 
signalling levels could be rejected 


Interleaved sequence of parity checks (Hamming code) 

Take a character 2" bits long where n is a positive integer Parity bits are alio 
cated at all bit positions 2" where m is positive with all values from 0 to n The 
n -H 1 panty bits are set to provide even parity on selected bits m the word such 
that panty bit x starts at bit position x and checks x bits counting itself, leaves 
out X bits and checks the next x bits, and so on The final check bit checks all the 
bits in the character The redundancy factor is computed from 

n + 1 

2n - (n + I) 

which gives 100% for 8 bit, 45% for 16-bit, and 23% for 32-bit characters 
An example with eight bits per character (n = 3) and four bits of data 
(f? = 100%) IS shown m Table 134 Any single bit errors will show odd parity 
m bit 8 (2") at the receiver and will also appear in some of the other parity bits 
1, 2 or 4 If odd parity is indicated by a 1 the binary value of bits 4, 2, 1 m that 
order will indicate which bit is m error so error correction can be carried out by 
reversing this bit For example if bit 6 was in error bits 8, 4, and 2 will have odd 
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Table 13.4 Code for single-error correcting and 
double-error detecting 


Decimal 

number 



Position in sequence 
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1 

0 
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1 
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1 

0 

1 

15 

1 

1 

1 

1 

1 

1 

1 

1 


parity and the binary value of 4, 2, 1 (110) indicates that bit 6 must be reversed. 
An error will be indicated if two bits are in error as one of the parity bits will 
always be odd but no error correction can be made. Three errors cannot always 
be detected. 

This system, if incorporated with block transmission and longitudinal parity 
checks would be effective on high error rate circuits where one or more errors 
can occur on each block. Quadratic errors would also be detected. In most cases 
the error could be corrected thus eliminating the unacceptable time wasted 
caused by retransmissions on these high error circuits. 

There are many varieties of interleaved error correction codes and many 
complex systems have been developed. The main aim is to achieve the greatest 
reliability with the minimum redundancy. Although these systems are seldom 
used on normal data transmission circuits where block error checking is suffi- 
cient there could exist special applications for use with low performance circuits. 
Some special cases, such as control data to stored-program controlled SPC 
e.xchanges where error free transmission is essential, could be considered. 


Majority logic codes for error correction 

By using parity indicators for different combinations of row and column checks 
in the same block of data more than one check can be carried out on each bit 
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in the matrix A decision to change a bit can then be made on the indication of 
the majority of the tests 

Research has been earned out on vanous combinations for majority logic 
codes to get the greatest number of checks with the minimum redundancy 
For further readmg on all error detection and correction procedures see 
Bennett and Davey (1965). CCITT (1964, 1972), Due (1973, 1974, 1975), Lucky 
et al (1968), and Martin (1972a, b) 


13,9 MESSAGE SWITCHING SYSTEMS 


13.9.1 Introduction 

Message switching networks, sometimes known as store and forward nets^orks, 
are practical when one-way transmission of messages is all that is necessary 
The main advantages over line switchingare that speed and code conversions are 
practicable and no time is lost at the input terminal in attempting to set up a call 
to an outstation which may be busy Disadvantages are the costs of providing 
message storage facilities and any special control procedures to ensure that 
messages are not lost 

Design considerations for message switching networks include 

(a) Signalling speeds and limits of the switching system 

(b) Maximum message storage capacity of the switching centre 

(c) Maximum permissible delay m transferring messages (cross office delay) 

(d) Multistation and polled working requirements 

(e) Special control procedures with the outstalions, such as VDU procedures 

(f) Address and message envelope structure 

(g) Control station requirements for diverting and holding traffic, retrieving 
messages, message logs, and traffic reports 

(h) Fall back procedures in the case of arcuit and switching centre facilities 

Digital computers would mainly be used for control of all future message 
switching centres, along with some form of magnetic or logic storage of the 
messages Systems could consist of one of more major message switching centres 
or a network of major and minor centres If many oulstations are required, 
particularly in the case of low traffic, the transmission and fine terminations 
costs at the major centres lead to the need for small, low capacity, secondary 
switching centres or message concentratore In these cases the remote centres 
could direct the traffic on higher speed, or more heavily loaded, circuits to the 
mam central switching centres where most message analysis, editingand message 
storage would take place Low level local switching could also be provided if 
necessary. 
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13.9.2 Message Security 

There is a saying that a message is not lost if it is known that the mcssag.o is lost, 
The originating terminal can always repeat the message if a lost oi’ invalid 
message alarm is received. With point-to-point and circuit svvitcliing sysloms 
the sender can confirm delivery if some response or acknowlcdgomenl is rcceivc'd 
after transmission, for instance as in the answerback message of Telex macliiiies. 
On links between computers and more sophisticated terminals, link proeediires 
with block transmission, error detection, and block acknowlcdgenieiU can he 
used. These link procedures are used for the transfer of messages between 
message switching centres but cannot be used on the circuits to low speed 
terminals. With store and forward switching there is no direct response in the 
return direction from the receiving terminal to the sending terminal so ;i|)eeial 
message handling and monitoring procedures arc required. Some jnethnds 
are; 

(a) Sequential numbering of messages. The switching centre will aiilomalically 
check the sequence numbers of incoming messages and generate an alarm 
if there is a break in sequence. Sequence numbers will be generated on 
messages from the centre to the outstations. 

(b) Character parity checks. These are applied on all characters in the messag.e 
(if alphabet No. 5 used) with alarm generation when parity error', a;e 
detected. 
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13.93 Control Station Functions 

Control stations are required on message switching networks to handle the 

alarms and perform various traffic supervision functions Some functions are 

(a) receipt of alarms as described in Section 13 9 2, 

(b) open and close circuits to traffic, 

(c) divert traffic to other arcuits, 

(d) obtain traffic reports, number of messages, queue sizes, 

(e) obtain circuit status reports, 

(f) get message logs of input and destination of messages, 

(g) retrieve messages from the message storage area, 

(h) perform accounting functions 

13.9.4 Message Concentrators or Low Traffic Switching Centres 

It IS often the practice to provide some form of buffering between low traffic 

outstations and message switching computers The advantages are 

(a) The occupancy of the mam switching computer is reduced as time is not 
spent in handling and storing low speed messages on character by 
character basts 

(b) The polling, circuit control, and local alarm procedures can be handled in 
the concentrator 

(c) The costs of leasing long, low speed, low traffic circuits are reduced 

(d) As the local terminals are connected over a short circuit, both way working 
could be economical and the concentrator could send status alarms and 
responses to each message directly to the outstation Failure of these 
responses would indicate a fault condition, shortage of buffers or more, and 
the outstation would not send further traffic 

(e) Local switching of messages which do not require processing m a central 
switching computer is possible, however, most concentrators would have 
limited storage 


13.9.5 Traffic Calculation 

Methods used to dimension a message switching network are detailed m 
Chapter IV of (Martin, 1972a) 

By using the traffic data pattern, calculations can be made on the response 
time for interrogation and response traffic, queuing calculations, number of 
terminals, grade of service on junctions, message distnbution and buffer 
requirements, multidrop traffic calculations, and other applicable design 
parameters Additional published material of relevance to the preceding part of 
this chapter includes Abramson and Kuo (1973), BSTJ (1975), Bell Telephone 
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Labs (1970), Martin (1967), McKinnon (1970), McKinnon et al. (1977), Miller 
(1978), Schwartz (1977), and Smith (1971). 


13.10 SIGNAL BEARERS 


13.10.1 Bearers as Subsystem Elements 

Measurement systems consist of basic analog and digital subsystems inter- 
connected to provide the required overall input-output relationships. It is 
important that the various subsystems be interfaced correctly if they are to 
perform as intended. But even with this condition satisfied, it should not be 
assumed that subsystems can be connected together without need to consider 
any other parameters in the interconnection process. 

In practice the individual subsystem assemblies may be geographically 
separate— such as in the remote control of an offshore oil well by a shore-based 
computer, the recording of test data from a missile, the control of banking 
accounts by a central computer centre or the sensors of a refinery which send 
data to the central control room. Each of these require some form of data 
transmission system— in the instrumentation sense these are called telemetry 
systems. 

When making connections it is also important, especially when noise sources 
are present that will interfere with the signal, to ensure that the signal is trans- 
ferred from stage to stage without significant noise pick-up or signal degra- 
dation. 

The bearer methods used should be considered as subsystem elements 
requiring as much consideration of their properties as is given to any part of 
the total measurement system. 


13.10.2 Types of Transmission Bearers 

Although there does exist the occasional application where data can be sent 
conveniently over mechanical or hydraulic forms of data channel, by far the 
most common method makes use of the electric form of signal. Either analog or 
digital information formats can be used on each of the kinds of electric signal 
bearer, the main criterion for selection being whether the chosen bearer method 
has adequate frequency bandwidth to convey the appropriate frequency spec- 
trum required to be received. 

The theoretical understanding and design methodology for electric signal 
bearers matured some years ago for analog signal working, the trend to greater 
use of digital signals being where the greater emphasis has been placed in more 
recent times. 
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The need to pro\’ide greater message security and wider bandwidths of 
operation has created interested in optical fibre communication methods In 
the late I970*s experimental field ex'aluation systems were being brought into 
ser\ice on selected telephone routes and had been introduced in the control 
wiring of advanced military aircraft There can be little doubt that optical fibre 
methods will be progressively used to a greater extent Wolf’s (1979) handbook 
provides a state-of-the art review of this form of technology Other relevant 
works are Elion and Elion (1978X Midwinter, (1979), and Miller and Chynoweth 
(1979) 

Electric signals can be conveyed over bearers in which the signal is confined 
to a physical member— open wires, coaxial cables, and waveguides are con- 
sider^ here Alternatively it might be practicable to make use of electromagnetic 
(EM) or acoustic wave propagation In thecaseofEM radiation it is necessary to 
use a earner frequency much higher than the highest frequency of the signal in 
order to obtain the desired propagation properties Interconnections are dis- 
cussed in Harper (1972) 

Confined stgnai links 

The simplest links are formed using an open-wire circuit (supported on in- 
sulators) or a multicore cable (such as is used in local telephone distribution) 

Although apparently trivial, lines may, m fact, be an important part of the 
system They are not as simple as they first appear because they have a limited 
frequency response (hat must be adequate for the signal bandwidth to be 
transmitted Open-wire lines would not normally be used beyond 10 MHz. 
Above that coaxial cables are needed— these are useful to about 5000 MHz. 
When currents flow in a conducting line, magnetic and electric fields are set up 
around the wires Figure 13 17 shows these plotted for the various kinds of 
cable Open configurations radiaie energy, the amount increasing with the 
frequency of the signal 

A line, IS in reality, a distributed inductance and capacitance component 
which has losses due to the resistance oT the wire and the resistance to ground 
Figure 1318 shows how lines can be considered as a lumped-element equivalent 
circuit which can be analysed more easily 

The equivalent circuit approach to modelling lines is discussed in texts on 
transmission lines, examples being Connor (1972), Johnson (1950), and ITT 
(1970) Models such as that of Figure 13 I8b, provide an efficient approach to 
design for they provide means to evaluate the effect of the various parameters 
of construction on transmission performance 

Depending upon the factors that can be considered to be negligible for a 
particular case the equivalent can be reduced to simpler circuits At very low 
frequencies (less than, say, 100 kHz) a medium length line may be represented 
by the series resistance of the cable shunted by the capacitance of the line (sec 
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Figure 13 19 Reduced models of a Ime 

(a) loT 2 cable operated at low frequenc) 

(b) lossless cable operated at high frequencj 


Figure 13 I9a) Tjpical cables maj ha\e a resistance of around 0 05 m and a 
capacitance of 100 pF'tn. Hence a long length of shielded or open cable could 
proMde a considerable shunting effect that attenuates and phase shifts the signal 
because of the effectise first-order low -pass, filter, so produced. 

When connecting high output-impedance sensors to lines, as bttle as one 
metre of cable ma) be suScieot to attenuate the signal markedl) It ts a matter 
of appljmg Ohm’s law to the suitable equiralent circuiL 

B^use of the filtenng efiects of the cable the higher frequenc> signals trans- 
mitted will be degraded more than the low frequenaes— for example, square 
wa\es become rounded as well as attenuated and phase shifted. The high fre- 
quencj performance of the Une maj be impro%ed b> loading it with mductors 
placed at regular intervals. The inductance value is chosen to tune out the 
mherent capacitive reactance at the upper frequenc} where response begms to 
fall o0^ a method that extends the bandwidth some wa> bejond the mherent 
unloaded upper limit This techmque is used, for example, to broaden the 
bandwidth ofsubmanne cables. 

B} virtue of the surroundmg external shield actmg as the second wire, the 
coa.xiaI cable has no external field and, therefore, docs not radiate energy 
Because of this a well designed coaxial cable will pass from d.c. to microwave 
frequencies— that is, such a cable can have a bandwidth of about 5000 MHz. 
Coaxial cable is, therefore, potentially able to transfer much more mformation 
than open wires It does, however, need a common earth connection (asym- 
metric) and cannot be used m a balanced mode The bandwidth of practical 
coaxial cables is limited by resistive and dielectnc losses. 

^Tien the losses of the line can be regarded as insignificant (G = 0, J? = 0, m 
Figure 13 18b) the lumped-equivalent of the transmission hues reduces to L m 
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series and C shunting, as shown in Figure 13.19b. The net result is, rather 
surprisingly, that the line exhibits only resistance of a fixed value when looking 
into the ends. This called the characteristic impedance, Zq , for which 

Zq = (inductance per unit length/capacitance per unit length)^^^ 

The line appears to be purely resistive and the Zq value is decided by the design 
of the line or cable, not by its length. As a general rule it is the ratio of conductor 
sizes not the absolute size of the line cross section that decides the value of Zq. 
Examples are 600 fi telephone lines and 75 Q. colour TV coaxial feeder cable. 
This means, in practice, that we can interconnect units on the basis of matching 
all connections to the Zq value of the cable without having to worry about the 
cable length. If this rule is observed, no high frequency energy will be reflected 
at the termination to change the information being transmitted. 

However, if the line is very long matching must still be applied to obtain 
maximum transfer, but account must now be taken of losses. For example a 
typical 75 fi coaxial cable will have losses of the order of 2 to 5 dB per 100 
metres of length. 

Details of parameters of the various forms of cable are obtained from manu- 
facturers. In-house publications of the major national telecommunications 
administrations, such as British Telecom and Telecom Australia, are an excellent 
source of data where they are available. Practices related to cabling are covered 
in Harper (1972). 

When greater bandwidth than a nominal 1000 MHz is required multiple 
coaxial cables may be used if the bandwidth can be split. In applications needing 
higher bandwidth for a single signal it becomes necessary to make use of 
waveguides. 

Waveguides consist of precise pipework and convey travelling electromagnetic 
waves of very high frequency along the internal cavity. They cannot be used for 
low frequency transmission. 

The cross-sectional area needed for a waveguide is inversely proportional to 
the design frequency. As a general rule of thumb the upper frequency limit of a 
waveguide is where the wavelength of the signal becomes one quarter of the 
guide aperture, millimetre wavelength signals (50 HGz or so) being the practical 
upper limit. 

Beyond this, a still wider bandwidth is obtainable theoretically using optical 
fibre transmission elements which will pass radiation in the visible light region 
(lO^'*' Hz to 10^® Hz). At the current state of technology, however, it is not 
possible to detect radiation cycles beyond far infra-red signals (around 10^ ^ Hz). 
We cannot, as yet monitor individual cycles of light with electronic detectors. 

Optical fibre transmission operates through modulation of light by high 
frequency signals. Waveguide theory and technology is explained in Collin 
(1966), Connor (1972), ITT (1970), Johnk (1975), Kennedy (1977), Ramo et al. 
(1965), and Saad (1971). 
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Radiation links 

Eleclncal signals fed into open wires radiate energy out into the surrounding 
medium As well as this radiated energy there also exists a near field that remains 
established, storing energy This is the field associated with, say, an electro- 
magnet As the frequency rises, the ratio of radiated energy to stored energy 
increases For this reason it is possible to build elTicicnt radio systems prOMded 
the frequency is kept above lOQ kHz or so Lower frequencies can be used as 
transmission systems but the power input needs nse enormously for the same 
distance radiated in free space (The Omega navigation system uses extremely 
powerful VLF signals because of its reliability of reception and the ability to 
penetrate deep into the waters of the ocean ) Beyond the gigahertz frequency 
region, circuitry becomes impracticable with current technology Even though 
the radiated energy link must operate at a very high frequency in order to 
operate efficiently it might not necessarily need to use the bandwidth available 
on the earner 

Free space electromagnetic radiation transmission is a highly developed 
subject Its theory and practice are presented m many publications, a selection 
of which IS Hamsher (1961), ITT (1970). Kennedy (1977) and Picqucnard 
(1974) Radio engineering as a total system is covered in Ross (1980) 


Skin effect 

The alternating magnetic field produced around a wire has the effect of causing 
the current flow mg in the wire to flow at a greater density in the outer region of 
the wire the higher the frequency the more pronounced is this so called skm 
effect At very high frequencies so little current flows in the centre of the cable 
that the centre is often omitted completely, thus a tube is used as a conductor 
The tube is also convenient for passing cooling fluid to remove heal losses For 
example, at I MHz the majority of the current flows m a copper cable to a depth 
of only 60 pm whereas at 60 Hz the depth would be 8 6 mm This also means 
that the effective resistance of a wire nscs significantly with frequency— by 
factors of 100 


Optica! and acoustic, free space, links 

Visual-optical and infra-red beams are sometimes used as Imc-of-sight earners 
carrymg information as modulation of amplitude or of polarization angle 
Commencial opto-electronic communication links are available, one example 
being designed to transmit television signals, plus operator voice channels, over 
relatively short distances Thesefinduscwheremessagesecunty, from deliberate 
intervention or from environmental noise, is needed It is sometimes more 
convenient to set up such systems for temporary use than a cable Imk 
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Acoustic links have also been developed that use carriers ranging from 
10 Hz to 10 MHz. Acoustic links are less directional, for a given level of hardware 
cost, than the optical equivalent, this being a function of wavelength. 

Sources of general information on transmission systems usually contain 
chapters detailing the various kinds of bearers (see e.g. Bell Telephone Labs, 
1970; Hamsher, 1967; Kennedy, 1977). 


13.11 ANALOG COMPARED WITH DIGITAL SIGNAL 
TRANSMISSION 

In theory an analog signal can assume any value between its saturation limits. 
In practice only a finite number of adequately definable states can be assigned 
due to such factors as the uncertainty of noise levels, the non-linearity charac- 
teristics of the system, and the drift performance with temperature change and 
time elapsed. It becomes very expensive to design analog systems with less than 
1 in 1000 error levels. Thus analog systems typically are capable of conveying 
accurate information on from 200 to 1000 levels. 

Digital signals, in the main, operate with the binary two-level system, the 
minimum number than can be used to convey information. 

Consideration of the tolerance to error-producing physical effects shows that 
the two-state signal has a far greater probability of being received intact than 
does the multilevel analog alternative but that, because the 0 and 1 levels are not 
infinitely apart, there is always a finite probability of an error occurring. For 
this reason it is often still necessary to add redundancy into the system to gain 
an increased level of reliability. 

The SNR of the analog signal will always degrade in transmission to some 
extent. It is far less possible to restore the signal, in a repeater, to its original 
state than is practicable for the binary case. 

However, due to the larger numbers of information-bearing states, analog 
systems can convey more information in a given time. Thus an important 
decision to be made in choosing a system is the extent that data rate can be 
traded for error risk. 

Analog signals are often more appropriate as much of the physical world is 
analog by nature. In such cases additional interfacing circuitry will be needed 
if digital transmission is to be used. The same applies to the receiver where 
analog actuators are often present. 

Until the 1960’s analog methods were preferred because, by default, digital 
hardware and methods had not then been as extensively developed. The 
sophistication of digital circuitry then proposed, brought with it high manu- 
facturing cost. The introduction of integrated circuit microelectronic, mass 
production methods rapidly diminished cost as such a significant factor of 
design. 
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Digital and analog systems use the same bearers A link created to transmit 
analog signals of a given bandwidth and voltage or current swing will be capable 
of transmittmg a digital signal transmission provided the bandwidth and signal 
limits are appropriate 

Conversely, however, a system designed to transmit digital signals will 
usually not be suitable for conveying analog signals 
It IS to be expected that the overwheirrang availability of digital transmission 
experience and hardware will considerably influence designers to adopt an 
mcreasmg number of digital methods as time progresses There will, however, 
always be special cases where the analog alternative will be a better choice 
The concept of the data highway has its ongins in the data bus system used in 
digital data processing machines In the late 1960*s this concept was adapted for 
computer control of process instrumentation systems {Collins, 1968) and today 



Figure 13 20 Architecture of data highway based, plant control system (Reproduced 
by permission of Honeywell Inc ) 
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several process instrumentation manufacturers market (see e.g. Figure 13.20) 
digital data highways for use with extensive instrument systems such as are 
found in process plants. 

Dominant reasons for adoption of the data highway approach were the needs 
to reduce installation cost, to increase flexibility to commissioning and later 
adjustments to plant operation, and to improve message reliability. 

As the number of monitoring and control functions rose with development 
of plant sophistication so did the proportionate cost of cabling— it is not 
uncommon to need as many as a thousand control loops. Furthermore the need 
for operator redundancy in information transfer arose to reduce the risk of 
data transmission failures. 

Further advantages of the data highway approach are that the control and 
monitoring functions can be arranged in a distributed manner and that diagnostic 
and control units can be accessed virtually anywhere in the system’s daisy-chain 
highway. To increase security the highway bearer, basically a single coaxial 
cable, can be duplicated and run on different physical routes. 


13.12 INTERFACING 

It is usually necessary to consider how the measurement system block at the 
sending end should be interfaced to the transmission system and how the bearer 
should interface to the receiving end unit. Two parameters of general importance 
are matching and connection configurations. 


Matching 

Three basic matching criteria exist when connecting two stages together; 
Figure 13.21 summarizes these. 

If the need is for maximum power transfer, as when driving an actuator from 
an output stage of an amplifier, the output impedance of the driving stage must 
equal the input impedance of the stage being driven, this being the case if the 
output impedance cannot be made small enough to be considered zero; which 


Signol/energy flow 



Figure 13.21 Summary of matching 
relationships 
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Figure 1 3 22 Chart showing commonly met source amplifier and output device connection arrangements 
(Reproduced by permisson of Siemens Industries ) 
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is the most efficient method of driving a load from a source. When maximum 
voltage transfer is required, as occurs when a pick-up cartridge or other voltage 
generating transducer is used or when measuring a voltage in a circuit, the rule 
is to ensure that the connecting stage has a much higher input resistance than 
the output resistance of the stage producing the voltage signal. A factor of ten 
to one hundred times is usually sufficient. 

The opposite situation, that is, loading a high output impedance stage with a 
low input impedance, arises when the maximum current transfer is required. 

In many cases the appropriate buffer amplifier is required to provide the 
desired matching condition. In certain a.c. coupled systems— those which do 
not require a d.c. path between stages— a transformer can provide an adequate 
impedance match in an economic way. Transformers, however, can have limited 
frequency response and must be chosen carefully to suit the signal requirements. 

In the above it is assumed that the transmission system is ideal. Consideration 
of the characteristics of various bearers shows that it is often necessary to inter- 
face bearers to transmitters and receivers with the appropriate impedance 
conversion. 


Connection configurations 

Output configuration of the various stages involved in instrumentation can take 
many forms depending on how the earth is connected and if the signal is sym- 
metrically or asymmetrically connected. Six commonly encountered source 
output schemes are shown along the top of Figure 13.22. On the left-hand side 
are seven common kinds of amplifier connection (any other form of black box 
could be regarded similarly). On the right-hand side are leader lines that show 
a link between the output of the chosen amplifier and one of the two most 
commonly used instrument connections — fully isolated circuit with case only 
grounded, or one pole grounded to earth. Using the legend, the chart shows the 
applicability of connections between chosen combinations of source arrange- 
ment, amplifier, and output device. Not-possible situations usually arise because 
the earth connection shorts out one of the source arms. 

Transmission bearers must be selected to suit the number of poles and earthing 
requirements for a given case. 

13.13 PROCESS INDUSTRY TELEMETRY 

Process plants such as oil refineries, paper mills, brick kilns, power stations, and 
aluminium refining plants are often monitored by using hundreds of sensors 
connected to the control-room area via analog instrumentation links. These are 
wired using shielded wire or coaxial cable. Because of the extremely high 
electrical noise levels of such plants and the low output signal levels of the 
sensors, these links could pick up significant noise thus degrading the sensor 
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informalioti Over the years process instrument suppliers have, to some extent, 
standardized the design of control systems and their installation The effect of 
noise pick-up by the cable has been reduced by several methods 

One strategy is to superimpose the information signal onto a standing bias 
current or voltage thus raising the wanted signal level above expected noise 
le\els Two systems commonly used transmit the signal range of the data 
through 4-20 mA dc or 10 SOmVdc systems A 0-20 mA system is also 
common Current transmission has the advantage that the circuit is of low 
impedance— a few ohms— which reduces the level of induced noise power 

An alternatise method is to amplify the millivolt level signals produced by a 
sensor to span them in the range 0-10 V or higher, ready for transmission The 
additional unit that interfaces the basic sensor, such as a thermocouple or a 
differential pressure cell, to a hoe with the appropnate sending signal format is 
known as a transmitter 

As a consequence of expanded use of digital transmission, mentioned in 
Section 13 1 1, it is to be expected that the analog methods mentioned above will 
gradually be displaced by transmitters that condition the sensor’s analog signal 
into an appropnate digital form The growing interest in silicon mtegrable 
sensors, those produced by the mass production methods of microelectronics, 
with on-board signal processing and Ibnnat conversion will also bnng about a 
marked change in the physical hardware of sensors and signal transmission m 
situations where a large number of sensors is invohed 

13.14 SIGNAL TRANSMISSION IN EXPLOSIVELY HAZARDOUS 
ENVIRONMENTS 

Often the sensor has to be placed at a location where an explosion could result 
from a spark or excessive overheating of a malfunctioning sensor circuit The 
most obvious way of overcoming this is to place the whole unit in a flame-proof 
enclosure This method, however, has disadvantages the cost is high, and testing 
and maintenance difficult due to the need to shut off the power when the 
enclosure is opened 

Arr aAtenrafrvif, nreTftocf m ibnown as tnttinsic safety As inffammabi’es require 
a specific level of energy to ignite them, explosion can be prevented by ensunng 
that the sensor stage cannot, under any conditions, provide enough ignition 
energy No enclosures are needed and the circuit can be maintained whilst it is 
operatmg Originally the concept was implemented by ensurmg that the sensor 
circuitry could not draw, or produce via storage, more than a specified power 
level This level was found by experiment m a test ng set up for the situation 
mvolved 

A more recent approach is to use safety barriers At the entry pomt into the 
declared hazardous area the cables terminate into a shunting Zener diode and 
attenuator arrangement which ensures that the current and voltage entermg the 
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area are limited to safe values. Figure 13.23 a shows the circuit of a typical Zener 
barrier. Another safety device uses a solid-state closely-coupled electro-optic 
link which provides d.c. electrical isolation between its input and output, the 
information being transferred from a light-emitting diode mounted next to a 
silicon photodiode detector. These ensure that overvoltage or induced earth- 
loop currents cannot enter the isolated hazardous area. 

A wide range of barrier devices is marketed. Figure 13.23b demonstrates, as an 
example, how a slide-wire displacement sensor can be interfaced from a safe to 
a hazardous area using the barrier of Figure 13.23a. 

It must be made clear that there exists a bewildering range of intrinsic and 
other safety codes and that they differ widely from country to country. Barriers 
designed and manufactured by specialist companies (see e.g. MTL, 1976), are 
to be preferred to constructing one’s own circuitry— in many cases it is manda- 
tory that the device be approved by the appropriate approvals body. 

Other methods are also used, including use of a pressurized or purged en- 
vironment around the sensor so that flammable vapour cannot come into the 
sensor region. Another approach is to design the equipment so as to decrease 
the risk by reducing operating temperature levels using contactless devices and 
the like. A brief, but useful, introductory review of this subject is provided in 
Jones (1974) with a more extensive treatment being available in Maeison (1974). 
The lie A symposium proceedings (IICA, 1980), although centred around 
Australian standards (which are based on British sources), contains useful 
papers on intrinsic safety practice. 


(a) 


To 
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(b) 



Hazardous 

oreo 


Sofe 

areo 


Figure 13.23 Shunt diode safety barrier; (a) one form 
of basic circuit; (b) use of units of (a) to couple slide- wire 
potentiometer into hazardous area 
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In areas where explosion or other catastrophic effects might conceivably occur 
It IS the transmission bearers that need protection It is not uncommon to make 
use of pressurized non-combustable gas in cable ducts or to place cables in 
sealed ducts The case of multiple cables on different routes between the same 
nodes of a system has already been mentioned 
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P. ATKINSON 


Closed-loop Systems 


Editorial introduction 

Measurements are clearly important to the formation of closed-loop control systems. 
This chapter, however, presents closed-loop system concepts from the opposite viewpoint 
because many instrument systems use closed-loop systems to improve performance in 
such ways as are seen in the feedback-connected operational amplifier and in the improve- 
ment of the time response of actuators. This review presents the elements of the more 
traditional linear control methods as these are very applicable (but too infrequently 
applied) to instrument systems design. It also includes discussion of the closed-loop 
digital systems now becoming popular due to the wholesale acceptance of the microproces- 
sor as the prime data processor in even the simplest of instruments. An excellent comple- 
mentary review, listing many instrument examples, is provided in Jones (1979). 


14.1 INTRODUCTION 

Modem closed-loop control systems can be traced back to James Watt who 
invented the first rotative steam engine (patented in 1781). Initially the speed 
of these engines was controlled manually using a throttle valve on the steam 
inlet. In 1788, Boulton, Watt’s co-principal in a company manufacturing the 
new steam engines, had visited the Albion Mill and was able to describe to 
Watt a form of centrifugal governor being used to regulate the grinding speed 
of the mill-stones. Watt soon adapted the governor to measure and control the 
speed of his steam engines automatically (Atkinson, 1968). Since those days, 
closed-loop control and measurement have been inexorably linked; indeed 
to control a physical quantity accurately and rapidly in the presence of changing 
demand, changing internal parameters, and changing load, measurement of 
the control quantity coupled with feedback control is essential. Such measure- 
ment can, of course, only be accomplished by means of instruments. A further 
development down this path is that feedback itself is being used more and more 
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m measuring instruments themsekes to improve the accuracy of measurement 
This notion is discussed further m Section 14 II 

In the nineteenth century, vast numbers of regulated steam engines were in 
operation m factories throughout Bntatn These closed-loop systems were by 
no means perfect in that the precision of speed control was often rather poor 
Engineers, with no real insight into system dynamics, attempted to improve 
performance by making smaller, lighter, and better lubricated governors 
Much to their consternation this normally led to an unforeseen associated 
difRculty, the steam-engine speed then tended to oscillate or hunt continuously 
about the demanded value This particular form of mstahility, as it is now called 
has plagued designers of feedback control systems of all kinds ever since The 
tremendous practical importance of this difficulty stimulated no less a person 
than James Clerk Maxwell (1868) to investigate the problem in great depth 
He brought mathematical insight to bear by relating the existence of instability to 
the presence of positive real parts in the complex roots of the characteristic 
equation for the system (see Section 144) 

The 1914 18 war caused military engineers to realize that to wm wars it is 
necessary to position heavy masses (e g ships and guns) precisely and quickly 
Classic work was performed by N Minorsky (1922) m the USA on automatic 
ship steering and H L Hazen (1934) defined a servomechanism for the ^rst 
time The concepts of automatic control, as they developed, are covered in 
Bennett (1979) 

The problem of mechanical position control provides an ideal example to 
illustrate the need for feedback in control Let us suppose we have the problem 
of controlling the angular position of a heavy rotatable mass, the resource of 
mechanical or electrical power assistance rather than total reliance on muscle 
power allows us the obvious advantages of rapid control To simplify the 
problem, let us suppose that we have an ideal fnctionless electric motor at 
our disposal and to achieve maximum acceleration of the rotatable load we 
couple the motor to the load through an ideal fnctionless stepdown gearbox 
It will be assumed that the motor produces a torque at the load which is directly 
proportional to its supply voltage In order to control the supply voltage to 
the motor we connect it to an ideal power amplifier which receives at its input 
a control voltage Uj which is directly proportional to an angular positional 
signal 0,, this signal is applied manually through a light handwheel connected 
to a position-to-voltage transducer (such as a rotary potentiometer) The 
notional arrangement is illustrated m Figure 14 1 , this system will produce 
rapid acceleration of the rotatable mass in response to small and effortless 
motion applied manually to the handwheel When the handwheel is at a nominal 
zero position the mass will cease accelerating, a change m the handwheel 
position in one direction will produce acceleration in one direction and a change 
m position in the opposite direction will produce acceleration in the other 
direction A simple mathematical analysis is as follows 



CLOSED-LOOP SYSTEMS 


593 


a 

U 

\ 


Figure 14.1 Position control system without feedback 

Let the effective moment of inertia of the moving parts, referred to the 
position of the rotatable mass, be J. Also let 

Vi = kffi 

v^ = KVi 

and the effective torque acting on the mass T be given by 

T = k^v^ 

where fc,, and k„ are constants. Thus 

T = kMOi = KOi 

where K is a composite system constant. Now in the calculus notation, according 
to Newton’s second law of motion 


The response to a small change in 0; from zero to a fixed positive angle is shown 

in Figure 14.2. . . , 

In order to change the position of the mass from one fixed position to another 
in the shortest time we must first accelerate the mass and then retard it in sue a 
way that at the instant the required position has been reached, both the ve ocity 
and acceleration are simultaneously reduced to zero. The manua con^o 
problem is extraordinarily difficult; it is analogous to steering a car wit a 
trailer backwards along a desired path. Worse still, in many circumstances 
(e.g. control of the position of a gun turret) there will be load distur ance 
caused, for example, by wind gusts which will cause the mass to deviate rom e 
desired position in a random manner. 
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Figure 14 2 Response of position control 
system without fe^back 


It would appear that the position control problem might be solved similarly 
to the way in which Watt solved his speed control problem, that is, by means of 
feedback of a measured value of the controlled variable The controlled vanable 
0, can be measured by means of a transducer identical to that monitoring 
so that a signal (given by Vo ^ k,v^ is available for comparison with Vi 
The notion is now that if 0| is made equal to the required position of the rotatable 
mass, the amplifier can be fed by a difference signal (p, - uj which is given by 


c, - t)<, = k,0, “ k,$^ = k,c 

where £ is defined as the posiiionat error between the required position 0, 
(termed the command or input) and the actual or output position 0, This error 
signal IS now amplified as before and applied to the motor Thus a dnving 
torque will always be present as long as 0„ is different from 0( When they are 
the same there will be no dnving torque and the mass will hopefully stop 
moving at the point where we want it to be The notional practical arrangement 
is showm in Figure 14 3 An analysis of this system is very revealing 
The effective torque T acting on the mass is no longer KOi but is now equal 
to Ke Thus again applying Newton’s second law, we have 


But £ = 0j — Oe, hence 




= K(0, - 0„) 


d*0„ 

J-^ + KO^^KOi 


and so 
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This will immediately be recognized as the equation for simple harmonic 
motion. Hence a sudden displacement in the handwheel position will lead to 
continuous oscillations as illustrated in Figure 14.4. Mathematicians define 
this system as critically stable because its oscillations will neither increase in 
magnitude nor decay; control engineers tend to regard such a system as un- 
stable. This system is exhibiting exactly the same type of behaviour as Watt’s 
regulated steam engines and is entirely unsatisfactory for practical position 
control systems when compared with what can be achieved. 

In non-ideal practical systems there is always some friction present which 
always acts against motion and this will cause the oscillation to decay event- 
ually. There are, however, various forms of friction including stiction (the 
torque necessary to just cause motion). Coulomb friction (a constant torque 
independent of velocity) and viscous friction (a torque which is directly pro- 
portional to velocity). Stiction and Coulomb friction both cause undesirable 
side-effects (stiction producing stick-slip motion when the system is commanded 


Q? 



Figure 14.4 Response of position control system with feedback 
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Figure 14 5 Position control system stabilized with velocity feedback 


to follow a constant velocity input and Coulomb friction producing a constant 
offset in response to a constant input) It is thus essential to minimize suction 
and Coulomb fnction by the correct mechanical design and to ensure that the 
VISCOUS component dominates or that a similar dominating effect is reproduced 
by other means The effect of viscous friction on the differential equation of 
motion of the system is to add an extra term proportional to output angular 
velocity thus 


J 


dt* 


+ f -r^ + = Ke, 

dt ' 


where F is the viscous frictional torque per unit angular velocity Practically 
one may achieve the required viscous dampmg by either attaching a physical 
viscous damper to the rotating mass or by feeding back an extra signal (derived 
from another transducer — this time a tachogemrator) which is directly pro- 
portionaf to angufar vefocity Tfie second of these arrangements is shown m 
Figure 14 5, electric motor-tachogenerators in single units are commercially 
available to facihtate the scheme The electronic differencing circuit will now 
produce a signal 


kE-k^ 

' * dr 


where k, is the tachogenerator constant Hence the drive torque T produced by 
the motor is given by 
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Again assuming that all forms of friction are negligible and using Newton’s 
second law: 

k(ks-k^ 

= Ke- k„k,K^ 


Hence 


J^ + k^k^k,^ + K6, = Ke, 


dr 


The term k^k^k^ can be regarded as the equivalent viscous frictional constant F. 

The analysis of the response of this system to various inputs is important 
for two reasons; firstly the arrangement forms the basis of many recording 
instruments used in practice (e.g. X-Y recorders and X-t recorders) and 
secondly it represents the embodiment of the second-order system which is 
used as an important reference in the design of higher-order systems (see 
Section 14.2.5). The arrangement constitutes a basic position control servo- 
mechanism. 


14,2 DETERMINATION OF THE DYNAMIC BEHAVIOUR OF A 
CLOSED-LOOP SYSTEM USING THE DIFFERENTIAL EQUATION 

14.2.1 The Laplace Transform 

It has been seen how the differential equation of a simple feedback control 
system may be derived by the application of physical laws (Newton’s second 
law of motion in the example given). However, in order to determine the 
behaviour of the system in response to certain inputs we need to have available 
a method of analysis; the method of Laplace transforms (Gardner and Barnes, 
1942; Goldman, 1966 and Chapter 4 of this volume) provides a suitable basis 
for this analysis. 

The Laplace transform of a signal di(t) is formally defined as ©i(s) in which 
s (p is also often used) is the complex variable a + ]co and 



Here c is chosen to be larger than the real parts of all the singularities of 0i(s). 



598 


IIANDBOOk OF MEASUREMENT SCIENCE 


Fortunately there is never any need to evaluate these integrals m practice 
because they ha\e been tabulated in transform pairs to aid the rapid solution 
of differential equations 

In the absence of initial conditions we may transform derivatives by the rule 


dl" 


= s-F(s) 


and integrals by the rule 



in which if represents the operation of taking Laplace transforms and F(s) 
is the Laplace transform of /(t) In situations where the initial conditions are 
non zero, then 

■2’^[/(03 = 5"f(s)-s'-'/(0-)-s" V‘(0-) -/■ '(0-) 

and 

se r/(i)d( = lF(s) + 02z> 

Jo ^ 5 

where/(0— ), /‘(0-), /" ‘{0-) are the values of the function and its 

« — I derivatnes and / " ‘(0 -) is the value of the time integral of /(t) just prwr 
lo the application of the signal at r « 0 It should be noted that the limit t » 0 
IS used here, whereas in rigorous mathematical texts in which the derivatives 
of discontinuous functions are not legitimate functions, the lower limit is quoted 
as t = 0+ However in practical analysis m which the unit impulse function 
(5(f)) IS used, a more consistent methodology results by using a lower limit 
r = 0- 

A short table of transform pairs is included here for reference purposes in 
the examples that follow (Table 14 1) Other tables are given m Chapters 4 
and 6 Notice that when control engineers use the single sided Laplace transform, 
all the drmng signals are considered to operate after t = 0, they are defined 
as zero prior to this instant and this may be conveniently represented as multi 
plying all the time functions by the unit step u(f) 


14 2 2 Anaijsis of the Simple Position Control System with Viscous 
Damping or Velocity Feedback 

Returning to the simple position system described in Section 14 I, it is necessary 
to define the form of the input signal and the important initial conditions 
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Table 14.1 Some functions and their Laplace trans- 
forms 


m 

Fis) = ^[/(t)] 

Unit impulse function 3(t) 

1 

Unit step function u{t) 

1 

s 

Ramp function u(j:)t 

1 

? 

Exponential delay i/(t)e~“' 

1 

5 + a 

Exponential rise u(/){l — e"") 

oc 

s(s + a) 

u(t)r e”“' 

1 

(s -E ay 

i((t)sin(co„ f ) 

«rt 

s^ + 

sin(cUr,t) 

(s + ay -t- 

«(t)e““' cos(cu„f) 

s -E a 

(s -E a) -E 

fit - t) 

e-^F{s) 


Assuming zero initial conditions we may transform both sides of the differ- 
ential equation thus: 

Js^e„(s) + Fse„(s) + Ke„(s) = kq^^s) 

Hence 


©o(^) = 


KQjjs) 

Js^ + Fs + K 


iKfJ)Q,{s) 

+ (F/J)s -I- K/J 

Consider the response to various input signals. 


The unit impulse response 

If 0i(r) = S(t) then ©i(s) = 1, and so 

^ K/J 

~ s^ + iF/J)s + K/J 
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Let US take the case for which F <2y/(JK), so that we must complete the 
square in the denominator, then 


Qo(s) = 


K/J 

(s + a)^ + £0* 


where 


a 


_F 

If 


and 


tj„ 


'K _ 

4J‘ 


This does not quite agree with any of the tabulated Laplace transforms, but 
with a slight modification it yields 




<o„ (s + ot)* + a)„ 
which may be in\erse transformed to yield 
KfJ 

^o(0 = o(t)— ^e~*'sin(tUnt) 


This IS the umt impulse response of the system 


The unit step response 

If 0,(0 = u(0 then 0,(s) = I/s. thus 

This must be broken into partial fractions thus 

* s tJrt (s + g)* + ct»rt (s + «)^ + tUn 
Each term has a recognizable inverse Laplace transform 
a 

0o(0 = t^(0 s*n(g>rt0 “ u(t)e“ cos(w„0 

Wrt 

= u(i){l - + (a/Wrt)*]sin[c)rt( + tan"‘(ajr,/a)]} 


The unit ramp response 

If 0,(t) = 1/(0. then 0,(s) = l/s^, and so 

KfJ 

” s=[s’ + (F/J)s + A'/i] 
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This must be broken into partial fractions and yields a response 


where 


0o(O = m(0 ^ 



Xe~°' 

F(o^ 


sin(cOrj t 



^ = tan ^ 


2aGJ, 


2a^ - KIJ 


The various responses are illustrated in Figure 14.6. The quantities M^^, Tp, 
Ejs, and £p form some of the basic means of performance specification. 


14.2.3 The Concept of Damping Ratio and Undamped Natural Angular 
Frequency 

The value of the damping term F relative to the terms J and K governs the 
dynamic behaviour of the system. There are four possibilities: 

(a) F = 0 in which case the system will oscillate continuously with sinusoidal 
oscillations of angular frequency ^{KIJ). This quantity is termed the 
natural undamped angular frequency. This response is termed critically 
stable. 

(b) F < 2j(JK) in which case the response contains an exponentially 
damped sinusoidal mode and will exhibit overshoot in response to a step 
input. This is called the underdamped response. 

(c) F = 2yJ(JK) for which the response is critically damped, that is, it does not 
quite overshoot in response to a step input. 

(d) F > 2y/(JK) for which the response is a double exponential rise in 
response to a step input. This is called the overdamped response. 

In Section 14.2.2 we concentrated on condition (b) because it is in practice 
the most important case because it allows the system to settle within a given 
tolerance band (Atkinson, 1968) around the desired value faster than any other. 

It is convenient to non-dimensionalize the effect of the damping term F 
by relating it to the value required to achieve critical damping where F^ = 
2f(JK). We define the damping ratio C as 


F, 2j(JK) 

so that for C < 1 we have an underdamped system, for C = 1 a critically damped 
system, and for C > 1 an overdamped system. 
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Figure 14 6 Time domain responses of a stabilized position control 
system (a) impulse (b) step, (c) ramp output, (d) ramp error 


The differential equation may be rewntten in terms ofa)„ and C thus 




The response to a unit step input for vanous values of C is illustrated in Figure 
14 7 
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Figure 14.7 Dimensionless unit step response 

14.2.4 Frequency Response 

The steady-state behaviour of the system in response to a sinusoidal input is of 
considerable practical importance. When we refer to the frequency response of a 
system we mean the variation of the phase and magnitude of the steady-state 
output of the system as the frequency of the input sinusoid is varied over the 
range of interest. 

The Laplace transform of the output of the system is related to the Laplace 
transform of the input by the equation 

s^0o(s) -t- 2C(OnS0o(s) -1- (U^0<,(s) = CJ^&i(s) 

For steady-state sinusoids we may substitute jco = s, where co is the angular 
frequency of the input, and produce the following operational relationship 
between ©i(jm) and ©oQ'cu): 

Qo(jtQ) (4 

®i(jw) iiojf -f 2Ca)„{jco) -F col 


col — +j 2 Cc 0 n(U 

1 

1 - ico/coff + }2Ccolco„ 



Frequency ratio,*/ 


Figure 14 8 Frequency response characteristic (a) mag 
nitude, (b) phase 
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It is rather simpler to work in terms of a non-dimensional frequency ratio 
u = (o/(On for which 

Qo(j«) ^ 1 

0i(j») (l-u^)+}2Cu 

From this expression we can determine the modulus M, its peak value Mpf 
(if any), and the phase (p which are given by 


VCd - uf + dCuf] 

= 2CV(1 - Cd 

and 

The magnitude and phase characteristics are illustrated in Figure 14.8. 
The angular frequency at which the frequency response has its peak value 
is designated by the symbol It may be shown that 

w.f = - 2C^) 


14.2.5 Second-order Correlations 

The time domain (step response) and frequency response of the second-order 
system are connected through correlating equations; these may be combined 
as graphs which are valuable in the approximate design of higher-order systems 
(Atkinson, 1968). These graphs are given in Figure 14.9. 




Figure 14,9 Time and frequency domain characteristics of the sccond-ordcr system 
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143 TRANSFER FUNCTIONS AND THEIR USES 


143 1 The Transfer Function Concept 

In analysing the simple second-order position control system we used the 
Laplace transform to determme the response of the system to vanous com- 
mands In Section 143.3, the relationship between the Lapla« transform of 
the mput to the servo 0,(s) and the Laplace transform of the output is expressed 
m the form 

= (0^0, (s) 

We may recast this relationship in an alternative manner m which the 
transforms are expressed as a ratio called the tranter function 

Oil 

0,(s) + 2C(0^5 + Oil 

This relates the output of the s>stem to its mput in a manner which is more 
convenient when handling complex systems Rather than determine a comph 
cated differential equation or set of differential equations, we may bypass 
this problem by combining the transfer functions of the basic elements by means 
of some very simple rules The overall transfer function may then be used m 
vanous wa>'s m order to anal>se or design the system. A range of transfer 
functions for commonly-encouDiered devices has been denved m vanous 
textbooks (Thaler and Brown, I960,Atkmson, 1968,Towill, 1970) 

It should be understood, however, that although transfer function techniques 
are very useful m the analj-sis and design of moderately compheated systems, 
the methodolog} does have stnet [imitations, for example, it can only be 
apphed to Imear, tune mvanaat systems (or those which are substantially 
linear at them operatmg pomt) with zero initial conditions A very useful 
account of the limitations is available m Truxall (1972) 

1433 Rules for Combining Transfer Functions 

Two basic rules are all that are required for the determination of the overall 
transfer functions of a set of interconnected nonmteracting elements Inter- 
actmg elements can usually be converted mto non-mteractmg devit^s m 
theory by using certam simple analytical techniques (Atkmson, 1968) Indivi 
dual elements in a system may then be represented m a block diagram showing 
the interconnections of the blocks (Figures 14 10, 14 11) For n cascaded, non- 
mteractmg elements, as illustrated m Figure 14 10, the overall transfer function 

= /f,(s)H,(s)H 3 (s) H.(s) 
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Figure 14.10 Elements in eascade 


For the feedback combination illustrated in Figure 14.11 

QM _ H,(s) 

0i(s) 1 -F H,(s)H^(s) 

Alternatively the inverse transfer function 

These transfer functions can be used in assessing the time domain response 
to any deterministic signal for zero initial conditions. The steady-state response 
to sinusoids may always be assessed by the substitution s = jco where co is the 
angular frequency. 



Figure 14.11 Elements in parallel 


14.4 PERFORMANCE ASSESSMENT USING THE TRANSFER 

FUNCTION 

When analysing linear continuous control systems, it is usually possible to 
represent any single input/single output system by a simplified block diagram 
as shown in Figure 14.12. 



Figure 14.12 Notional single-input/ 
single-output system 
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The Signal 0^ niay in fact be a measured lalue of the output in practice rather 
than the physical quantity itself , the reduction of a complex system to a notional 
block diagram of this form is, however, a valuable analytical procedure because 
the concept of unity feedback often reduces urmecessary complication m the 
design process In practice the measured value of the output is fairly tightly 
related to the actual value so that the procedure is normally fully justified 
The open loop transfer function His) can normally be represented by the re- 
lationship 

A'expc-srorif-i + r,.5 + 1 ) 

^ n?-i + 1) 

where 

K = the error constant 
Ti. s= pure time delay 
ni = t>p€ number of the system 
T.„ and are constants defining the leads 
and 7ii are constants defining the lags 
p IS the number of leads 
q IS the number of lags 

It should be noted that some or all of the leads may be quadratic as indicated 
or simple of the form (1 + sT^,), equally well some or all of the lags may be 
simple exponential of the form (I + sT,,) or quadratic Quadratic factors with 
teal toots may be usefully factorized to simple factors 
It should also be noted that although pure time delay is only rarely encoun- 
tered in servomechanisms it is often present m process control systems where the 
transport of material at a finite velocity is a necessary function of the system 
In the analysis of control systems we are concerned with many aspects of the 
behaviour absolute stability, relative stability, response to commands, response 
to disturbances, and the sensitivity of these to parameter changes It should be 
noted hero ihst sa ahsuAiTe^yemsnih&rsyscemtsone ^hoseantpnt ts nnboonded 
for a bounded input, a relatively stable system is an absolutely stable system m 
which the transients in response to a command are not excessive and decay 
rapidly Relative stability is nominally a rather qualitative concept but it can 
be quantified very readily in vanous ways (c g the peak overshoot and settling 
time m response to a step input are quantitative measures of relative stability) 
The closed-loop response is governed by 

Qc(3) H(s) 

0,(s) 1 + His) 

fCexp(-srjnf-i(r..i^’ + 7',|S+ 1) . 

s”nf-i + T.,S + 1) + KexpC-sTjnr-i + T„s + 1) 
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When the denominator of the closed-loop transfer function is equated to 
zero we obtain the characteristic equation of the system. The roots of this 
equation in s govern the modes of the transient response of the system. Each 
negative real root gives rise to an exponential mode of the form A exp(— ajL) 
and each complex conjugate pair of roots with a negative real part gives rise to 
an exponentially decaying sinusoidal mode of the form B exp{—a 2 t)sm{o}^A + 
(j)). The existence of positive real roots or positive real parts of complex roots 
implies an unbounded output for a bounded input and consequent absolute 
instability. For systems containing no pure time delay, absolute instability 
may be predicted directly from the characteristic equation using the Routh- 
Hurwitz stability criterion (Atkinson, 1968; Towill, 1970). The variation of the 
roots with variation in the error constant K plotted in the s-plane is called the 
root locus diagram. The form of the root locus diagram for a system having an 
open-loop transfer function 


" " s(l + sTi)(l -t- ST 2 ) 

is shown in Figure 14.13. 

The root locus concept and its application in the design process have been 
described by various authors including Truxall (1955), Thaler and Brown 
(1960), and Towill (1970). In root locus analysis the transfer function is expressed 
in terms of poles of the form l/[s -f- (l/T^i)] and zeros of the form [s + (l/Tj;)] 
either of which may also occur in pairs with complex conjugate roots. Design of 
well conditioned closed-loop systems revolves around the insertion of extra 
open-loop zeros and/or poles coupled with adjustment of the error constant in 
such a way so as to force the closed-loop poles (governing the closed-loop 
modes) into positions where a particular one (or ones) dominate the response 
completely. Usually, this approach is only partially successful because sub- 
dominant modes tend to modify the behaviour to an extent which is not im- 
mediately predictable. However, the method is an excellent way of handling up 



Figure 14.13 Typical root locus diagram 
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to Sixth-order systems without pure time delay Higher-order systems and those 
having pure time delay are probably best handled by frequency response 
techniques (as explained in Section 14 5) 

The steady-state performance of a stable closed-loop system is determined 
by the type number m and the value of the error constant K Figure 14 14 
gives a summary showing how type 0, 1, and 2 systems respond to various 
typical command inputs 

In a particular application the lowest suitable type number should be used 
because the higher the type number, the more difficult it is to design the desired 
well conditioned system Type 0 systems having a sufficiently large error 
constant will be quite adequate for many regulators where the object is to 
maintain the output at a constant level and tracking is unnecessary Type 1 
systems are perfectly satisfactory where dynamic accuracy is not vital (eg 
instrument servomechanisms, fin servomechanisms on guided weapons) 
Howeier, m systems where dynamic accuracy under tracking conditions is 
vital (e g machine-tool control systems) it may be necessary to use type 2 
systems (or at least type 1 systems with very large error constants) 

The analytical determination of the response of a system (or even its root 
locus diagram) is normally impossibly complex for computation by hand 
Analog or digital or hybrid computer simulation is invariably necessary m all 
but the simplest systems (Atkinson, 1972) 



Figure 14 14 Responses of type 0, 1, and 2 Systems with various inputs 
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14.5 THE FREQUENCY RESPONSE 

Whereas the calculation of the time domain response of control systems is 
extremely involved, even for quite simple systems, the frequency response is 
relatively easy to determine. Moreover the closed-loop stability can be deter- 
mined from the open-loop frequency response by means of the Nyquist stability 
criterion (Atkinson, 1968; To will, 1970; Thaler and Brown, 1960; Truxall, 
1972). Also the closed-loop frequency response of a stable system may be 
rapidly assessed from the open-loop frequency response. 

The open-loop frequency response must be determined from the expression 

(. _ A:exp(-jmrL)nf=o + T,i(ia3) + 1] 

^ (jfu)'” n^o + TM -i- 1] 


It is either necessary to calculate the magnitude and phase of this expression 
or to calculate the real and imaginary parts over the range of frequencies of 
interest. A simple example will serve to show that although this task is simple 
in theory, it is very arduous when performed by hand. 

Consider the open-loop transfer function 

4(1 -b jctf) 

ja)(l -f j0.5co)(l -f jO.lm) 


for which 


and 




4^(1 -b 0 )^) 

mV[l -t- (0.5co)2]V[l + (O.lm)^] 


/H(ico) = tan ^ m ^ — 0.5co — tan ^(0.5m) — tan ‘(O.lco) 

Although the necessary computation required to evaluate these expressions 
is very time consuming when performed by hand, it can be performed very 
rapidly using a digital computer. 

A Nyquist diagram is a Argand diagram on which //(jca) is plotted over the 
required range of frequencies. An inverse Nyquist diagram is an Argand 
diagram on which H~ ^(jco) is plotted. 

A Bode diagram shows a graph of |H(jcw)| expressed in decibels (dB) (i.e. 
20 logiolH(jto)l) versus angular frequency plotted on a logarithmic scale 
together with the phase / H(ico) plotted separately. Notionally, Bode diagrams 
may be plotted more simply than Nyquist diagrams when it is possible to make 
use of asymptotic approximations to the gain characteristic. Frequently the 
occurrence of quadratic lags and leads in transfer functions makes this very much 
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less useful than it ^\ould appear at first sight Nyquist, inverse Nyquist, and 
Bode diagrams representmg the transfer function 

“ Ml + jidTiXI + JwTi) 
are shown m Figure 14 15 for \anous \a1ues of K 


Imoginory 




J 

PhfiM morgr 


'• 






■ 


»«0 



RmI 



Figure 14 15 Frequency response representations for 
OJt = A/[jfl)(l + jcjr,){l + jorj)] (a) N>quist diagram, 
(b) inverse Njquist diagram, (c) Bode diagram 
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The advantage of the frequency domain approach is the simple connection 
between the open-loop frequency response characteristic and the closed-loop 
frequency response. A locus of constant closed-loop modulus M on the Nyquist 
diagram or the inverse diagram is a circle. It is thus a simple matter to assess 
the value of the closed-loop resonance peak Mpf or to determine the value of K 
to give a specified value of Mpf. A selection of M-circles and inverse A^f-circles 
is shown in Figure 14.16. 



Imoginary 



Figure 14.16 M-circles and inverse M-circles: (a) loci of constant 
closed-loop modules M on the Nyquist diagram; (b) loci of con- 
stant closed-loop modulus M on the inverse Nyquist diagram 
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Figure 14 17 Denning Ihe closed-loop resonant peak A/p, 

The resonant angular frequency o)^ in the frequency domain is the angular 
frequency at which the frequency locus just touches the maximum M-circIe 
(Figure 14 17) 

Translation of open^loop data on a Bode diagram to closeddoop data is 
best performed on a Nichols chart (Atkinson, 1968) 

Open^loop frequency response characteristics give good measures of the 
relative stability of the clos^ loop system Systems having gain margins of at 
least 10 dB and phase margins of 40® to 50® (Figure 14 15) have adequate 
relative stability when operated on closed loop A system having a peak value 
of M * 1 3 (Figure 14 17) will also have good relative stability on closed loop 
It IS also possible to make quite good guesses of the closed-loop step response 
from the open-loop frequency response A system having A/pf of about 1 3 will 
have about 20% overshoot and a nsc time to the first maximum of approxi- 
mately 3/ajrf It must be understood, however, that these values are very rough 
estimates based on second-order correlations (Section 14 2 5) 


14.6 DESIGN SPECIFICATIONS 

The original design specifications for a feedback system may be phrased in 
numerous ways depending on the purpose of the system has to serve and the 
expected envelope of commands, disturbances, and internal parameter vana- 
tions Typical specifications may be phrased m terms of 

(a) Step response Time to first maximum (7^) and per unit overshoot p, 
tolerance zone and settling time (Atkinson, 1968), threshold, integral of 
absolute euor, integral of absolute error squared, integral of time x abso- 
lute error (Towill, 1970) The system type number and error constant 
would also normally be specified in conjunction with step response 
entena 



CLOSED-LOOP SYSTEMS 


615 


(b) Ramp response. Ramp peak error, settling time, steady-state error (or 
velocity error constant). 

(c) Frequency response. Resonant angular frequency co^f (or bandwidth); 
resonance peak Mpf', phase margin and gain margin; type number and 
error constant. 

(d) Disturbance response. Maximum permitted output to given disturbances 
over the expected range of disturbance frequencies. 

Normally systems must meet a given set of specifications over the expected 
envelope of system parameter variations and in the presence of transducer and 
signal noise. Systems specifications may be translated from time domain to 
frequency domain and vice versa using second-order correlations which will 
give a very approximate guide to higher-order system behaviour. 


14.7 MECHANISTIC MODELLING AND MODEL ORDER 

REDUCTION 

When the specification for a system has been phrased in a suitable form the 
first task in the design procedure is to form a mechanistic model of the plant to 
be controlled. Mechanistic modelling may or may not be a difficult task, 
depending on the nature and complexity of the plant or process. Approximate 
linear models of frequently-encountered electromechanical, electronic, and 
electrohydraulic systems may be fairly easy to derive (Atkinson, 1968; Towill, 
1970) although there will often be considerable doubt about the precise values 
of parameters. Linear models of some complex thermal and thermochemical 
processes may be more difficult to derive analytically (Maudsley, 1978). It is, 
however, a necessary part of the paper design stage of a control system to form a 
mechanistic model. Once a mechanistic model has been determined it is often 
necessary in all but the simplest cases to form a reduced-order model which is a 
sufficiently accurate representation of the original system to allow certain 
design procedures to progress. There have been various methods proposed for 
producing low-order models (Towill, 1973). 


14.8 IDENTIFICATION 

Although it is usually necessary to form a mechanistic model of the plant to be 
controlled so that a proper understanding of the plant dynamics can be obtained, 
this procedure is often essential because it is usually necessary to design the 
control system before the plant has actually been constructed. However, once 
the plant has been set up it is advisable to check that the proposed mechanistic 
iDodel is actually an adequate description of the plant behaviour. This requires 
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a practical method of identification and there are various possible methods 
(Davies, 1970, Graupe, 1972) including 

(a) the injection of a known impulse, step, or ramp signal and the analysis 
of the response, 

(b) the injection of a senes of sinusoids of various frequencies and the cor- 
relation between input and output, 

(c) the injection of a pseudo-random binary input and the cross-correlation 
between input and output 

Method (b) has some very distinct advantages m that it is possible to identify 
certain non-lmeanties as well as the linear elements from one set of data 
It IS also a relatively simple procedure to fit a transfer function model to a set 
of frequency response data 

Identification is also necessary to ensure that the bread-board system and the 
development prototype are within the required specification and that all 
production models are healthy at the lospection/mamtenance stage 


t4.9 COMPENSATION 

Having determined a mechanistic model of the plant (verified at the earliest 
possible stage by identification) in the form of a transfer function, the next 
problem m the design process is to decide on a control strategy and then 
determine the parameters of the elements required to implement that strategy 
The decision regarding which strategy to use must be based on experience and 
intuition, indeed it may be necessary to investigate various strategies before a 
satisfactory one is obtained The determination of the parameters of the elements 
required to implement the strategy can then proceed using one of the standard 
methods of design (preferably with the aid of a computer-aided design (CAD) 
suite — see Section 14 12) such as the Nyquist diagram, inverse Nyquist Diagram, 
Bode diagram (Atkinson, 1968), root locus diagram (Truxall, 1955, 1972, 
Thaler and Brown, 1960, Towill, 1970) or the coefficient plane (Towill, 1970, 
1975, Ashworth, 1975) 

The most naive approach is to devise a proportional control scheme as 
illustrated in Figure 14 18 It is very rare that any but the simplest specification 


Proportionol 

controller Plant 



Figure 14 18 Simple proportional control 
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Senes 

compensator Plant 



Figure 14 19 Series compensation 


can be met by such an approach and the least one might expect is the need to use 
a series compensator having a transfer function //^(s) such that the overall 
closed-loop system is brought within the performance specification (Figure 
14,19). The steady-state performance specification in response to both com- 
mands and disturbances will normally indicate the type number the system 
must possess (refer to Figure 14.14 for commands) and, therefore, the number of 
extra pure integrations which must be introduced into the loop. 

Typical series compensators have the following transfer functions and 
effects ; 

1 -4- sTi 

(a) H,(s) = K ---- 

1 -t- sTz 

where time constant Tj > T 2 for phase-lead compensation which can be 
used to raise the error constant moderately for increased stiffness, and to 
raise the speed of response; and where Tj < T 2 for phase-lag compensation 
which can be used to raise the error constant greatly for increased stiffness 
and to lower the speed of response marginally. 


(b) 


H.(s) 


m + sT,) 
s 


which is called proportional plus integral (P + I) control or two-term 
control, and can be used to increase the type number and stabilize the 
resultant system. The resultant system will inevitably have a slower 
response than a system with the same plant under proportional control. 


HXs) = K, -F K2S -F 


which is called three-term control (P -F I -F D). This can be used to 
increase the type number, as with proportional plus integral action, but 
at the same time speed up the response. If the compensator is designed 
such that > 4 X 2 ^ 3 , then 'Is transfer function may be rewritten as 

X(] -F sr,)(l + 5T2) 


s 
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Hp[s) 


Poroll el compensat or 
HAs) 


Figure 14 20 Parallel compensation 


where Ti and T 2 are real time constants 


(d) 


H,(s) = 


K(l + sT,Xl + rj 

s(l + s7i) 


which IS proportional plus integral action with phase lead This com- 
pensator can be realized physically using electronic components rather 
better than the three term controller Its action is very similar to that of 
the three-term controller 


(e) 


H (s) = ^(1 + sT, + 

* 5(1 + s%) 


which produces control action similar to that of (c) and (d) but allows 
the numerator term in s to contain complex conjugate roots This is 
sometimes of value in cancelling highly underdamped quadratic lags in 
the plant 

An alternative form of compensation involves the use of parallel elements in 
auxiliary feedback loops (Figure 14 20) Such an arrangement may have 
definite practical advantages over the senes arrangement (D’Azzo and Houpis, 
1966, Atkinson, 1968) Typical parallel compensators H^s) have transfer 
functions and etfects 


(a) 


HJis) = k^s 


This IS called veloc\ty or rate feedback and can be used to raise the error 
constant of a system moderately and raise the speed of response 


(b) 


«c(s) = 


1 +sT 


This IS called transient velocity or rate feedback and can be used to raise 
the error constant of a system greatly and to raise the speed of response 

Hfs) ~ 


(c) 
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Figure 14.21 Combined series and parallel compensation 

This is called acceleration feedback. Positive aeceleration feedback can 
be used to raise the speed of response of a relatively stable system. 

(d) Hfs) - fcjS + 

This is combined velocity and acceleration feedback and when properly 
designed can be used to raise the error constant and the speed of response. 

It should be understood that while series compensators can usually be 
implemented very simply in electrically signalled systems by using cheap 
operational amplifiers in conjunction with resistance-capacitance networks, 
the implementation of parallel compensation usually involves the use of 
expensive transducers such as tachogenerators and rate gyroscopes. 

Sometimes the use of series compensation alone, or parallel compensation 
alone, is insufficient when the performance specifications are stringent. It is 
then necessary to combine the advantages of both ; parallel compensation can 
be used to linearize and speed up the plant and at the same time desensitize it 
to ehanges in parameters and to remove some of the effects of disturbances; 
series compensation is finally added to eliminate offsets in response to commands 
and disturbances and to bring the overall transient response within specifica- 
tion. The combination can be designed to be more powerful than either method 
used alone; the arrangement is shown in Figure 14.21. 

14.10 SENSITIVITY ANALYSIS 

One of the foremost problems in control engineering is that of the reduction 
of sensitivity of the system response to variations in the plant parameters caused 
perhaps by environmental changes, ageing, or by the variations attributable 
to the tolerances allowed to the hardware. Sensitivity is usually defined (Truxall, 
1955) in terms of a sensitivity function Sp which denotes the sensitivity of the 
closed-loop transfer function T{s) to variations in the plant transfer function 
P(s) and which is expressed as 

or _ dTIT 
” dP/P 
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It has been shown by Horowitz (1959) that 
^ 

^ l+Us) 

where L(s) is the open-loop transfer function Thus if the open-loop frequency 
response function L(jcu) plotted on a Nyquist diagram never enters the unit 
circle, centred on the point { — 1, jO), then Sj is never greater than unity and the 
closed-loop system is always less sensitive than the open-loop system Un- 
fortunately design for this condition may well be impossible Corresponding 
sensitivity functions have been derived by Horowitz (1963) for the transient 
response 


14.11 FEEDBACK INSTRUMENTS 

It is interesting to consider the relationship between feedback control systems 
(Figure 14 22a), in which measuring instruments (in the form of transducers) 
are used to facilitate accurate control, with measuring instruments (Figure 
14 22b) (again the form of transducers which are used in feedback loops to 
facilitate accurate measurement) The controller must be designed in exactly 



(a) 


Quantify 
to be 


Difference 

signal 


Indicotion of 
measured 



Figure 14 22 Feedback in control and instrumenialion systems 
(a) the general feedback control system, (b) the general feedback 
instrument 
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the same way in both systems and normally both must contain a series com- 
pensator designed to ensure adequate relative stability and small (preferably 
zero) steady-state error under static conditions. In the feedback instrument 
the object is to null the error signal between the quantity to be measured and 
its physical manifestation. The signal fed out from the controller is then an 
indication of the quantity to be measured. Jones (1977, 1979) discusses examples 
of such arrangements in some detail. 


14.12 COMPUTER-AIDED DESIGN 

Classical design of feedback systems, as explained in Section 14.9, used to 
involve the laborious hand computation and plotting of Nyquist diagrams, 
inverse Nyquist diagrams, Bode diagrams, and root locus diagrams. The 
necessary checks on the time domain performance are also very computa- 
tionally involved and time consuming. Only the invention of the analog com- 
puter (Johnson, 1956; Atkinson, 1972) gave some relief to this latter problem. 

When digital computers became readily available in the early 1960s, off-line 
computation of frequency response functions removed the need for hand 
computation. In the mid-1960’s cathode ray displays were interfaced to digital 
computers and work began on the utilization of this new design aid. Iteractive 
design then became feasible and a new concept in design emerged. This concept 
places the control systems designer in a loop, the object of which is to design a 
control system to a given specification as indicated in Figure 14.23. The labor- 
iousness of the design is removed and the designer is freed to practise his art, 
dealing with the parts of the problem which he does best, that is inventing the 
overall control strategy which he thinks is most likely to be the best in all 
engineering senses. Moreover, a rapid assessment of the time-domain per- 
formance can normally be made automatically and presented to the designer in 
the required graphical form. 


Description 
of mom 


Theoretical control 
system performance 
os indicoted by 
( I ) Nyquist diagram 



Figure 14.23 Control system design using CAD methods viewed as a feedback 

process 
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This automated form of design has not only freed designers from the labour 
of hand computation and graph plotting, it has encouraged them to try new 
schemes and to make their designs much more rugged in terms of sensitivity 
to plant uncertamty 

Computer-aided design is of great general importance m engineering as a 
whole, but IS nowadays the key to rapid and successful feedback control 
systems design There ha\e been seteral conferences on the topic (such as the 
lEE conference on CAD at Southampton in 1969 and 1972 and the lEE con- 
ference on Computer-Aided Control System Design in Cambridge, 1973) 
A journal entitled Computer Aided Design is published quarterly in the UK and 
Rosenbrock (1974) covers computer-aided control system design The availa- 
bility of low-cost microprocessor-based computer systems is an additional 
attraction ensuring the more general application of this powerful tool 


14 13 CLOSED-LOOP SAMPLED DATA SYSTEMS 


14 13 1 Introduction 


The closed loop systems considered in this chapter have all functioned on 
continuous data, however, many practical control systems function on sampled 
data (foe example, process control systems controlled by digital computers) 
Usually the concept of an ideal impulse sampler is used m this work, the sam- 
pling process is defined as the generation of a sequence of impulses at the 
sampling instants with the area or strength of each impulse equal to the ongmal 
signal vnlue at that time (Figure 14 24) The notation x*(t) represents the 
ideal impulse sampled version of the ongmal signal x(i) entering the sampler 
It should, of course, be appreciated nght from the start that the ideal sampler is 
a mathematical artihce and that in practical systems no impulses are actually 
present at any part of the real system However, the artifice provides us with a 
valuable analjTica! tool 


I Impulse 
somprer 


lU) At) 



Analog signol Nctonol tron of 

unit vnpulses 



Conventional symbol 
of impulse sampler 



Notional impulse sampled 
sgnol 


Figure 14 24 Aciion ofa theoretical impulse sampler 
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Continous 
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Figure 14.25 Closed-loop sampled data 
system 

Sometimes we are presented with closed systems which are effectively 
represented by the block diagram shown in Figure 14.25, in which the con- 
tinuous elements have a transfer function H(s). The precise analysis of this 
type of system may be very involved; however, in many practical situations the 
sampling interval 7^ is short compared with the time for a transient oscillation 
of the whole system. Under these circumstances it is possible to replace the 
impulse sampler theoretically by a pure gain of value 1/7^ and analyse the 
circuit by normal continuous theory or simulation. 

14.13,2 The Use of Hold Circuits 

In the arrangement illustrated in Figure 14.25, the impulse sampler is placed in 
the error channel of the feedback system. This is a fairly normal situation in 
control systems; in a properly designed system the main elements then act as a 
low-pass filter which smooths the sampled error so that the output G^if) follows 
the input 0i(t) over the required profile of inputs. 

Even in a system which has been designed correctly and which has a rea- 
sonably high sampling rate, the response will not be exactly the same as that 
of the equivalent linear system. It is often most economical to use low sampling 
rates, in which case the intersample ripple on the output may be very severe. 
In order to make the behaviour of the sampled-data system more like that of 
the continuous system and particularly to reduce intersampling ripple on the 
output, various forms of filter are used between the sampler and the continuous 
elements. The simplest type of filter is the zero-order hold or clamp. The actions 
of an impulse sampler with a zero-order hold are shown in Figure 14.26. The 
output is held at the previous sampled value until the next sampling instant. 
The transfer function of the zero-order hold takes the form; 

1 - expC-sTJ 
s 

It is shown in Stockdale (1962) that the total response of a sample and hold 
circuit approximates on average (ignoring harmonics) to a pure time delay of 
TJ2. This approximation is valid down to quite low sampling rates and allows 
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design to be performed by classical methods m the frequency domain The 
approximation can also be used for approximate transient analysis even at 
quite low sampling rates 

Further improvements in signal smoothing can be achieved by using pre- 
dictive hold circuits which use the past two values to predict (or estimate) the 
slope of the curve to the next sampling instant 

14.133 Frcquenc} Response Analysis of Sampled Data S>stems 

Although for many purposes frequency domain analysis may be employed for 
sampled-data systems using the approximate continuous equivalents described 
m Sections 14 13 1 and 14 13 2, at very low sampling rates, or in cases where 
more accurate analysis is required, it is necessary to use a more accurate 
technique Linvill (1951) developed a formula for computing the frequency 
response of a sampled-data system by making a vector addition of all the 
harmonics produced by sampling 

Given a continuous signal of Laplace transform £(s) then the Laplace 
transform E*(s) of the sampled signal is given by 

£*(s) = i X £(s + jnoj.) 

where n is an mteger and cu, is the angular sampling frequency The frequency 
response is then written in the usual way by substituting s = jco, that is, 

£*(J0)) = ^ E -EOoi + J"to.) 
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Tou (1959) has shown how frequency response loci (i.e. Nyquist diagrams) 
can be constructed using this series and how absolute and relative stability 
can be determined. Further, Atkinson and Allen, (1973) have shown how 
Linvill’s formula can be automated for use in interative computer-aided 
design. 


14.13.4 Time Domain Analysis of Closed-loop Sampled Data Systems 

The method of z-transforms (Chapter 10) provides a basis of time-domain 
analysis and stability analysis of a closed-loop sampled-data system. Its main 
limitation is that it provides information about signal amplitudes at the sam- 
pling instants only. It, therefore, provides no information regarding inter- 
sample ripple. 

An impulse-sampled signal e*(t) has a Laplace transform e*(s) which con- 
tains s in the form exp(s7^); the z-transform is obtained by making the sub- 
stitution z = exp(s7^). The z-transform can be represented as a series 

e(z) = f e(/i7;)z~” 

n = 0 


where « is an integer. 

We can interpret z“^ as a delay operator of 7^ seconds and z~^ as a delay 
operator of 27^ seconds and so on. The summation will take a general form 

e(z) = kg -f kjZ"^ -f k 2 Z~^ -h -f • • • + k„z~" 

Each term in the series contains the amplitude k„ at the sampling instant nT^ 
in the form /c„z“" (See Figure 14.27). The z-transforms for various functions of 
time are given in Table 14.2 for reference. 



Figure 14.27 Illustrating z-transform series 
representation of a sampled signal 
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Table 142 Laplace and z transforms of commonly met time functions 


Timefunction/(r) 

Laplace transform 

z-transform 

Unit step u(f) 

1 

z - 1 

Unit ramp ii(i)i 

I 

? 

T,z 

(-' - 1)^ 

Acceleration 
function ii{i)t^i2 

1 

? 

T‘.:i: + 1) 

2(z - 1)^ 

f 

u(t)- 

n' 

1 

,_0 H’ dj^yz-cxpi-ztz)} 


I 

Z 

u(0f e'” 

s + a 

1 

z - cxp(-ar,) 

TfZ exp( — aT,) 

is + a)‘ 

[z - cxp(-arj]^ 

u(f}Cl - e ") 

a 

[1 - exp(-270]z 

s(s + a) 

(z - l)[z - exp(-*r,)] 

u(r)sin((JnO 

iU„ 

z sin((u„ T,) 


z^ - 2z cos(a)„ rj + 1 

iKr)e‘” sin(u„i) 

w„ 

; eTp(j7i)siB(a)n 7^) 

(S -I- 2)* + 0-$ 

z* expiliT,) — 2z exp( 2 T,)cos(to„ 7^) -I- 1 

u(r)cos{u„ 0 

s 

z[r-cos(o„r.)] 


— 2z cos(£j„ 7^) + 1 

u(r)e"“ cos(cj„f) 

s + a 

2 ~ - zexp( — 27i)COS(cOn7^) 

(s + a)^ - 1 - oj^ 

z* - 2zexp(-a7;)cos(cj„T5) + cxp>(— 22 T,) 


Although the r-transform can be in\eiled m a number of ways, it is generally 
best to expand the expression into a power senes m powers of z~ * by algebraic 
long dmsion The coefficient of c”" corresponds to the \alue of the time function 
at the nth samphng mstant. 

Ragazzim and Zadeh (1952) and Truxall (1955) show how to calculate the 
response of a closed-loop sampled-data control system containing an impulse 
sampler in the error channel (Figure 14 28) They hat e shown that the r-transfer 
(or pulse tranter function) of the system is git en by 

Gjz) 

0,(2) 1 + HG{z) 

It should be noted that HG(z) ¥= H(z)G(z) 
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Impulse 

sampler 



Figure 14.28 Samplcd-daia conlrol 
sy.stcm 


The application of the closed-loop r-transform ean be illustrated most readily 
by an example (Figure 14.29) in which the input is a unit step function and it is 
required to calculate the output. 

4 4 4 

H(s) = = 

s(s -F 1) s s -f 1 

From the table of z-transforms (Table 14.2) 


where aT, = Thus 



4z 

z - exp(-aTs) 


ff(z) = 


2.53z 

I.37z + 0.368 


and so 


0„(z) H(z) 2.53Z 

©i(z) 1 + H{z) z^ -F ].16z + 0.368 


Impulse sampler 
(r. =1s) 



Figure 14.29 Sampled-data system 
for worked example 
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Now 0,(0 = “(0 Hence again using the table of transforms 

©.( 2 ) = ^ 

2.53r z 

©o(-) ' 2 ^ + 1 16z + 0368 z - 1 
2 532^ 

" + 0 162* - 0 7932 - 0 368 

By algebraic long division this expression yields 

©„(r) = 2 53z ' - 0 4062“* + 2 032“^ + 0 339z * 

+ 1 37z * + 08272"*^ + I 06Z-'' + I OOz ® + 

As can be seen from this response, although the system is highly undamped, 
Us output esentually conserges towards the input 


14 13J Stabilit) of Closed'loop Sampled Data S>stems using the Z>transform 
For a sampled-data control system the zuransfer function is given by 

©■><!) G(2) 

©,{r) 1 + HCiz) 

The stability of the system depends on the positions of the zeros of [1 + HCfr)] 
m the s-plane However, the transformation of exp(s^ = z has been made and 
the positions of the zeros can be mapped in the z-plane The mapping, z ~ 
expfsT,), maps the imaginary axis of the 5-plane into the unit circle about the 
origin of the z-plane and the left half of the s-plane into the interior of the unit 
circle (Tou, 1959) A sampled data control system will, thus, only be absolutely 
stable if all the zeros of 1 -f- HG(z) lie inside the unit circle The direct applica- 
tion of this cnterion may often be tedious and the use of a bilinear transform 
z = (1 + >v)/(l — w) maps mside of the aide in the z-plane into the left-hand 
side of the n-plane It is then possible to use the Routh-Hurwitz or the Nyquist 
stability cnlenon directly (Tou, 1959) 

14 13 6 Compensation of Sampled Data Control Sjstems 

Sampled -data systems offer a wider range of possibilities for compensation than 
continuous systems, senes and parallel or senes/parallel compensation of the 
kind desenbed m Section 14 9, using continuous transfer functions, can be 
designed m a similar way using a Nyquist or inverse Nyquist diagram based on 
the Linvill approximation (Section 14 13 3) Alternative strategies involve 
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Figure 14.30 Forms of digital compensation : (a) series digital compensation ; 
(b) series digital with parallel continuous compensation 


series digital compensators alone (Figure 14.30a) or in combination with a 
parallel continuous compensator (Figure 14.30b). 

The digital compensators can be arranged to produce phase-lead for which 
H^(z) = (z — a)/(z -F b) with the zero lying to the right of the pole in the z-plane 
or phase lag for which H^z) = (z + a)l(z — b) with the zero lying to the left 
of the pole. Many other forms of digital compensator could be envisaged which 
can be constructed from hard-wired logical circuitry or by means of software 
implemented on a digital computer (Frederick and Carlson, 1971); in these days 
of very low cost microprocessors, software implementation is clearly very 
advantageous. The arrangement illustrating control and digital compensation 
of a continuous process is shown in Figure 14.31; the error signal is generated 



Figure 14.31 Control and compensation using a digital computer 
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by the computer and is then operated on by a suitable algorithm before it is 
outputed to the plant via a digital-to-anaiog converter (DAC) Frederick and 
Carlson (1971) have described how computer programs for typical algorithms 
can be written 


14 14 NON-LINEAR SYSTEMS 


14.14 1 Introduction 

Although certain non-Iineanties (Coulomb faction and stiction) were mentioned 
in connection with the positional servomechanism m Section 14 l.all the theory 
contained m this chapter so far has been concerned with the analysis and design 
of linear systems In this connection we define linear systems as those which 
obey the principle of superposition (Gelb and Vander Velde, 1968) Although 
linear theory is an indispensable design aid, it must be realized that all practical 
systems do inevitably contain non-lineanties For example, every amplifier 
exhibits saturation so that when the error signal m a control system becomes large 
the control signal driving the plant will be limited m amplitude The result of this 
will normally be that the response time of the system will be longer than pre 
dieted by means of linear theory based on the assumption that saturation is not 
present 

There are many types of non-linearity They are distinguished in Gelb and 
Vander Velde (1968) as explicit or implicit non-lmearities Explicit implies that 
the output of the non-lineanty is explicitly determined m terms of the required 
input variables, whereas implicit implies a more complicated relationship 
between input and output through, for example, an algebraic or differential 
equation Then with explicit non-lineanties we have to distinguish between 
staiic and dynamic forms, in which dynamic implies that the output is related not 
only through the input but also through the denvatives of the input Among the 
explicit, static non-lmeanties we must again divide between single-ialue 
(memoryless) non-lmeanties such as saturation and dead-space, and multiple- 
laCue (memory) non-fmeanties such as hysteresis 

The analysis and design of non-Iinear systems is vastly more complex than 
that of linear systems, e^ery non-lmear system being a mmiature umverse, the 
tools atailable for such work are sparse and inadequate compared with those 
available for handling linear systems 

Although some non-lineanties are intentionally inserted by the designer to 
obtain improted system performance, (see e g Thaler and Pastel, 1962), for the 
most part they are a nuisance, causing undesirable side-effects when the error 
signal IS large (e g the effect of saturation) or when it is small (such as, backlash 
m gears which may cause tick to occur) In some circumstances (as with hys- 
teresis) It may be possible to effectively eliminate the effects of the non-hnearity 
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Figure 14.32 Non-linear element in high-gain feedback 
loop 


by means of a high-gain stable local feedback loop. This procedure is strongly 
recommended where it is possible. The procedure is based on the idea that the 
non-hnear element has a variable static gain K. It is driven from a high-gain 
linear amplifier of gain G and feedback is applied through an element of gain H 
as shown in Figure 14.32. Analysis shows that the transfer function in such cases 
is given by 

gp _ GK 
0; 1 + HGK 

The method relies on making HGK P 1 for aU possible values of K so that 

0; H 

The effects of the non-linearity are thus made quite negligible so long as none 
of the elements saturate. The arrangement is only of use so long as the loop is 
relatively stable, so that proper design of the loop is essential. This is probably 
best performed in the frequency domain using the describing function method 
(see Section 14.14.4). 

It was mentioned in Section 14.1 that stiction can cause stick-slip motion in 
servomechanisms following constant velocity (ramp) inputs. This problem can 
again be overcome by means of a high-gain local feedback loop. In this case 
velocity feedback is used around a motor-amplifier series combination; the 
gain of the amplifier is made very high so that the arrangement behaves like a 
perfect integrator of transfer function l/k^s where is the velocity feedback 
constant. The arrangement is again identical to that shown in Figure 14.32; K 
represents the non-linear motor characteristic H = k^s, and G is the high gain 
of the amplifier. There is usually very little problem in designing this loop to 
have adequate relative stability. 
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14.14.2 Methods of Stud>ing Non-lincar Sjstcms 

There are no completely satisfactory methods of studying non-hnear systems 
Computer simulations (analog, digital, or hybrid) are of great importance, to 
perform a simulation it is, however, necessary to choose a model to represent 
the important characteristics of a system Nevertheless a thorough computer 
study of any non-linear system is an essential final step before the real system is 
built 

Trial and error design based entirely on computer simulation is not advisable 
because it may take far too long and may not give a complete insight into the 
system beha^^our and the strategies which are available The first step in de- 
signing a non-linear system is usually to attempt to linearize the essential non- 
Iinearities for small signals about a working point For example, if we ha\e a 
non-linear device with an input signal x(r) and an output }<l) such that >-(t) = 
kx^(/) then dy/dx = 2ix, Ihvs we are working at some pariicvlar working 
point Xp, then for small departures from the working point, the effective gam 
of the device is 2kXp For working points at values of x greater than Xp, the gam 
will be greater than 2A.Xp and for working points at values of x less than Xp, 
the gam will be less Linear systems design combined with a sensitivity analysis 
(see Section 14 10) and a computer simulation should again prove adequate 

Phase plane analysis (^Vcst, 1953, Atherton, 1975) has provided a very useful 
tool in the analysts of non-linear second-order systems subjected to step (and 
ramp) inputs The technique involves the determination of the response in 
terms of its derivative of output (or error) plotted as a function of its output 
(or error) This plot is called a phase plane diagram, in formulating the relevant 
equations, time is removed explicitly and for many practical non-linearities 
where the non-linearities can be represented by linear segmented characteristics, 
the phase plane can be divided into various regions each of which corresponds 
to motion on a particular linear segment of the non-linearity (see Section 1414 3) 

For higher-order systems, on occasions where u is necessary only to determine 
whether or not the designed system will remain absolutely stable for the entire 
envelope of input signals, the methods of Lyapunov (La Salle and Lefschetz, 
1961) or that of Popov (Aizerman and Gantmacher, 1964) will provide exact 
answers without solving the differential equations 

The method of describing functions (Gelb and Vander Velde, 1968 , Atherton, 
1975) based on the concept of quasi-hneartzation for a given class of input 
signals provides the mam basis for the analysis and design of non-lmear feed- 
back control systems (see Section 14 144) 

14.143 Phase Plane Anal) sis 

Phase plane analysis considers any second-order differential equation of form 
X + /lA + Bx = F 
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where A, B, and F are not necessarily constant. A phase portrait consists of a 
number of phase trajectories in the x- versus x-plane. If we define y = x, then 

dy dy/dt _F — Bx — Ay 

dx dx/dt y 

This equation represents the slope of the phase trajectory in terms of functions 
of X and y. In general B, A, and F may be functions of x and y. To find the phase 
trajectory itself this equation must be integrated; sometimes this can be per- 
formed analytically but usually it is better done numerically using a digital 
computer or graphically using the method of isoclines (Thaler and Pastel, 
1962). 

By way of example, let us consider the simple viscously damped second-order 
sevomechanism described in Section 14.1 but with the additional Coulomb 
friction. Consider the response of the system with a step input and let C be the 
magnitude of the Coulomb frictional torque which always opposes motion. 
The instantaneous accelerating torque in Ke and the retarding torque is 
Fdo + C sign where sign is positive for ^ and negative for < 0. 
Thus applying Newton’s second law we have 

JOg = Ke — F^o — C sign 

Now g = 0i — 00, thus 9o = di — e; also for a step input ddjdt = 0 and 
= 0 under steady-state conditions. It is possible to translate the above 
equation into the error form: 

Je + Fe + Ke + C sign e = 0 

From this equation we can deduce 

Ke C sign e 
~ NJ + F~ NJ + F 

in which N is the slope of the phase trajectory where it crosses the isocline. 
The first term defines the family of isoclines for the linear system whereas the 
second term introduces the effect of the non-linearity. The focal point is changed 
from e = + C/K to e = — C/K as e changes from a negative value to a positive 
value. If a trajectory begins at e = A (where A is equivalent to the value of the 
input step) it transverses through the phase plane as shown in Figure 14.33. 
The slope of the phase trajectory is given by the value of N as each isocline is 
crossed. The determination of the passage of the trajectory through the iso- 
clines to point B when e becomes negative, thence to point D when it again 
becomes positive is a simple matter. Motion ceases at D because the generated 
torque is less than C. 

The phase plane technique can be used to analyse the behaviour of the second- 
order system for a variety of commonly-encountered non-linearities (see e.g. 
Atherton, 1975); the main disadvantage of the method is that it cannot be 
extended to higher-order systems in a satisfactory manner. 
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14 14 4 Tbe Method of Descnhiog FodcOoixs 

In itamost simp’s form thsdsscribingfuccuoo method is an eitension ofordmaiy 
transfer ftiscnon analj'sis to tale into account the e?»:t of single DOD'Imsanties 
m s>'stem» excited b> sinusoidal mpuL It is particularl) useful as a method for 
preictmg the ampbtude and frcquenc> of limit cycles but it can aLo be used to 
assess reUtire stability b> cca\entJODal frequeacj response methods (see 
Section 14J). In essence, the describing fiinction of a non-lmear element is the 
gam of the element m temii of the ratio of ih, furJamerjal corrponent output to a 
smijso*d 2 l mpui of gjven magnitude and frequency TiTiereas the gam of a 
linear element is only es’er frequency dependent* the cam of a non-lmear 
e’ement is always amplitude dependent and maj aL>o sometimes be freqnencs 
dej^denL Here we will confine our studj to amphtude-dependent describing 
frincoons. In order to Ulustiate this point a htlle better let us consider a sery 
common form of non-lineanty, namely saiuratwn. The output is hnearly related 
to the mput for small positire or negati\^ excursions of the mput, but the output 
reaches a hmitmg \alue for large excuisions (see Figure 1434). The response 
of this Eon-lmeanty to a smusoid wfll be smnsoid for signals, a ciipped 
smuiOid for medium size signals, tendmg to a square ware of magnitude KE, 
for reiT large signaK The gnm of the element, based on the ratio of fundamental 
output to mput thus is constant at a value K for sttl-^iTI mputs, beginning to 
decrease as the mput goes beyond £, and eventually trailing off towards zero 
as the fundamental output tends towards its hmitmg value of 4K£^/" as the 
mput tend* towards infinity (see Figure 1435). 
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Figure 14.34 Illustrating a saturation non-linearity 



Figure 14.35 Describing function Cq for saturation 
non-linearity 


For memoryless non-linearities the describing function is purely real, while 
memory-type non-linearities introduce a phase shift, causing the describing 
function to be complex. In general if the fundamental output is expressed as a 
complex operator 

KGcu) = 1^0 cos 0 -F jfo sin 0 

where 1^ is the peak fundamental output and 0 is the phase shift, then the 
describing function Gp is given by 

^ fo cos 0 . . Po sin 0 
(JD = — ^ — + J — ^ 

where is the peak input. 

To derive a describing function we must use the Fourier series representa- 
tion for the output; 

i’o(t) = cos(q)0 + ^2 cos(2a)f) + cos(3a}t) + ■■■ 

-F Bi sin((yf) -F B 2 sin(2(ur) -F B 3 sin(3cot) -F • • • 
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where 

/l^ = - f PoO)cos(Wg) 0 d(wO 
7t Jo 

and 

Bf, =~ { iJ^tysmiNoyt) d(G)r) 

Jo 

The definition of the descnbmg function requires and Afj and for 

// > 1 to be negligible The fact that most control systems contam low pass 
filtering elements which filter out the harmonics to a substantial degree usually 
justifies these assumptions 

The descnbmg function is then gi\en by 

B, A, ^ , 

+ j ~ m coordinate form 

or 

in polar form 

The etaluation of the Founer integrals may be quite simple as in the case of the 
saturation non Imeanty or extremely difficult 
Fortunately the descnbmg functions of some commonly encountered non 
hneanties hate been tabulated by Thaler and Pastel (1962) and Gelb and Vander 
Velde (1968) 

The descnbmg function for the saturation element is given by 
G„ - — [(sin ■ R) + 

7t 

The Njquist stability cntenon may be involved for the determination of the 
stability of systems contaimng a non linear element as shown in Figure 14 36 
The closed loop transfer function of this system is giv en by 

^ Gi0(o)GpG20oj) 

O, 1 + Cj(j<»)GQG2(ja)) 

The Nyquist cntenon is based on the characteristic equation 
1 + G,0a>)GDG2(joj) = 0 

To avoid the need to plot a sheaf of Nyquist diagrams for every value of Go 
we may reform the charactenstic equation as 

G,0a>)G20o>) = — I/Gd 
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Non -linear 

Linear elements elements Linear elements 



Figure 14.36 Non-linear feedback system 

The encirclement of the locus — I/Gq by the function Gj(ja))G 2 (j(y) plotted as a 
polar diagram now indicates absolute instability. Relative stability can be 
assessed by treating the locus — l/G^ as the equivalent of the critical point 
(-1, jO) used in linear systems design. The amplitude and frequency of a limit 
cycle can readily be assessed by the point at which the frequency locus 

Gi(i£u)G2(jcu) 

intersects the locus of — I/Gd- This is illustrated in Figure 14.37 which shows the 
Nyquist diagram of an unstable system that must limit cycle at angular frequency 
cUc where 

/Gi(jm,)G2(jm,) = -180° 

from which co^ may be calculated (generally by iterative trial and error). The 
amplitude of the limit cycle can now be determined from 

|Gi(jm)||GDl|G2(ja))| = 1 

(which is the Nyquist amplitude condition for continuous oscillation). The value 
of Gd can now be determined and the value of 1^ , the signal input to the non-linear 



non-linear system showing — l/G^ (unstable 
situation) 
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element, can be estimated by trial and error The error signal entering the first 
linear element can hence be computed from In the absence of any 

input to the system itself, the error and the output are identical m magnitude so 
that Bo = This is the magnitude of the resultant limit cycle In 

general it is necessary to determine the conditions graphically rather than 
analytically or by iteratise trial and error 
Both Gelb and Vander Velde (1968) and Atherton (1975) have shown how the 
method of descnbing functions can be used to design compensating elements for 
non-linear systems and be extended to treat multiple non-linearities, to handle 
transient oscillations, dual sinusoidal inputs, and random inputs 


14.15 CONCLUDING REMARKS 

Many lengthy textbooks ha\e been written on closed-loop systems so that it is 
hardly surprising that the matenal contained in this chapter merely licks the 
surface of a \ast and rapidly expanding subject The matenal described in this 
chapter is all based on the transfer function approach because this certainly 
offers the designer the most comprehensive set of techniques It should be 
understood however that many authors prefer to integrate the classical tranter 
function approach with the relatively modem state lanable approach In the 
state variable approach the system model is described in terms of n first order 
differential equations, each equation being a separate description of the be 
haviour of a particular state and its connection with the other states and the 
dnviog inputs This form of description allows the equations for the system 
states to be condensed into the form of a single vector x(t) and related to the 
driving input vector, u(t) by the equation 

x(0 = Ax(i) + Bu(f) 

where A and B are coefficient matnces 
The system outputs can then usually be related to the internal states by the 
vector equation 

y(0 = Cx(t) 

where C is another coefficient matru and }(f) is the output vector The solution 
to the first of these matrix equations can be determmed by various powerful 
computerized matrix methods and hence the response >( 1 ) for a given u(r) 
can be determmed quickly and accurately 
Most of the theory associated with state-vanable analysis emerged after the 
1960 IFAC Conference m Moscow at which the concepts of controllability and 
observabibty were introduced (Kalman, 1960) Apart from their application 
in time-domain analysis, state-variable techniques form the basis of the Lyapunov 
stability analysis and of optimal control theory (Ogata, 1967) It can also be used 
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in other design techniques (pole assignment, and pole assignment and de- 
coupling methods) using state vector feedback (Porter, 1969; Power and 
Simpson, 1978). 

More recently (particularly in the UK) there has been a return to frequency 
response methods especially adapted to multi-input/multi-output systems; 
these newer methods are particu}ar]y suited to computer-aided design (Belle- 
trutti and MacFarlane, 1971; Rosenbrock, 1974). 

Returning to the more classical approach, statistical methods of handling 
linear and non-linear systems subjected to random inputs has been completely 
neglected in this chapter. The powerful concepts of probability description and 
spectral analysis have however been introduced in several texts (see e.g. Douce, 
1963). 

In order to obtain greater sensitivity and better accuracy of measurement it is 
likely tl;iat more and more instruments will incorporate feedback. The de- 
signers of instruments of all kinds will, therefore, have to become more familiar 
with the analysis and design of feedback systems. 
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